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(54) Title: DNA MOLECULES ENCODING HUMAN NHL. A DNA HELICASE 

^ (57) Abstract: The present invention disclosed isolated nucleic acid molecules (polynucleotides) which encode NHL, a putative 
DNA helicase. The present invention in turn relates to recombinant vectors and recombinant hosts which contain a DNA fragment 
Q encoding NHL, substantially purified forms of associated NHL, associated mutant proteins, and methods associated with identifying 
J> compounds which modulate NHL, which will be useful in the treatment of various neoplastic disorders. Both a genomic clone 
^ containing regulatory and intron sequences, as well as the exon structure and open reading frame of human NHL are disclosed. 
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5 

TITLE OF THE INVENTION 

DNA MOLECULES ENCODING HUMAN NHL, A DNA HEUCASE 

10 CROSS-REFERENCE TO RELATED APPUCATIONS 

This application claims the benefit, under 35 U.S.C. §119(e), of U.S. 
provisional application 60/169,970 filed December 9, 1999. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
15 Not Applicable 

REFERENCE TO MICROHCHE APPENDIX 
Not Applicable 

20 

FIELD OF THE INVENTION 

The present invention relates in part to isolated nucleic acid molecules 
(polynucleotides) which encode NHL, a putative DNA helicase. The present 
invention also relates to recombinant vectors and recombinant hosts which contain a 

25 DNA fragment encoding NHL, substantially purified forms of associated NHL, 
associated mutant proteins, and methods associated with identifying compounds 
which modulate NHL, which will be useful in the treatment of various neoplastic 
disorders, given that this gene is located at 20ql3-3 and immediately adjacent to 
M68/DcR3, which is involved in tumor growth. Also included within the present 

30 invention is a human genomic fragment representing this portion of the human 
genome, along with three additional genes (M68/DcR3, SCLIP, and ARP). 
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BACKGROUND OF THE INVENTION 

Naumovski et al. (1985, MoL Cell Biol. 5:17-26; Reynolds et al. (1985 Nucleic 
5 Acid Res 13:2357-2372) and Weber et al. (1990 EMBO J. 9: 1437-1447) disclose 

members of the RAD3/ERCC2 gene family of DNA helicases. 

It is known that several chemotherapeutic agents inhibit helicases, including 

actinomycin CI, daunorubicin and nogalamycin (Tuteja, et al., 1997, Biochem. 

Biophys, Res. Comm. 236(3):636-640), and a prostate cancer drug, CI-958 (Lun, et 
10 al.,1998. Cancer Chemother. Pharmacol. 42(6):447-453). In addition, some 

topoisomerases have been shown to have anti-cancer activity. 

Despite the identification of the aforementioned helicase-encoding genes and 

chemotherapeutic agents, it would be advantageous to identify additional genes which 

reside within chromosomal regions associated with a disease state such as cancer as 
15 well as a gene which encodes a type of protein which may be associated with that 

disease. The present invention addresses and meets this need by disclosing a DNA 

molecule encoding a DNA helicase with a chromosomal location suggestive of 

association with cancer. 

20 SUMMARY OF THE INVENTION 

The present invention relates to an isolated or purified nucleic acid molecule 
(polynucleotide) which encodes a novel mammalian DNA helicase. 

The present invention also relates to an isolated nucleic acid molecule 
(polynucleotide) which encodes mRNA which expresses a novel human DNA 
25 helicase, NHL. 

A preferred aspect of the present invention relates to an isolated or purified 
DNA molecule which encodes human NHL, the nucleotide sequence as set forth in 
Figure lA-B and SEQ ID N0:1. 

The present invention also relates to biologically active fragments or mutants 
30 of SEQ ID NO: 1 which encode a mRNA molecule expressing a novel DNA helicase, 
NHL, Any such biologically active fragment and/or mutant will encode either a 
protein or protein fragment which at least substantially mimics the biological 
properties of the human NHL protein disclosed herein in Figure 2 and as set forth as 
SEQ ID N0:2. Any such polynucleotide includes but is not necessarily limited to 
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nucleotide substitutions, deletions, additions, amino-terminal tmncations and carboxy- 

terminal truncations such that these mutations encode mRNA which express a 

functional NHL protein in a host cell, so as to be useful for screening for agonists 

and/or antagonists of NHL activity. 
5 The present invention also relates to recombinant vectors and recombinant 

hosts, both prokaryotic and eukaryotic, which contain the substantially purified 

nucleic acid molecules disclosed throughout this specification. 

The present invention also relates to a substantially purified form of a human 

NHL protein which comprises the amino acid sequence disclosed in Figure 2 and set 
10 forth as SEQ ID N0:2. 

A prefened aspect of this portion of the present invention is a NHL protein 

which consists of the amino acid sequence disclosed in Figure 2 and set forth as SEQ 

ID N0:2. 

Another preferred aspect of the present invention relates to a substantially 
15 purified NHL protein, preferably a human NHL protein, obtained from a recombinant 
host cell containing a DNA expression vector comprises a nucleotide sequence as set 
forth in SEQ ID N0:1 and expresses the respective NHL protein. It is especially 
preferred is that the recombinant host cell be a eukaryotic host cell, such as a 
mammalian cell line. 

20 The present invention also relates to biologically active fragments and/or 

mutants of a NHL protein comprising the amino acid sequence as set forth in SEQ ID 
N0:2, including but not necessarily limited to amino acid substitutions, deletions, 
additions, amino terminal truncations and carboxy-terminal truncations such that these 
mutations provide for proteins or protein fragments of diagnostic, therapeutic or 

25 prophylactic use and would be useful for screening for selective modulators, including 
but not limited to agonists and/or antagonists for human NHL pharmacology. 

A preferred aspect of the present invention is disclosed in Figure 2 and is set 
forth as SEQ ID N0:2, a respective amino acid sequence which encodes human NHL. 
Characterization of one or more of these DNA helicase-like proteins allows for 

30 screening methods to identify novel NHL modulators that may be useful in the 
treatment of human neoplastic disorders. The modulators selected through such 
screening and selection protocols may be used alone or in conjunction with other 
cancer therapies. As noted above, heterologous expression of a NHL protein will 
allow the pharmacological analysis of compounds which modulate NHL activity and 
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hence may be useful in various cancer therapies. To this end, heterologous cell lines 
expressing a NHL protein can be used to establish functional or binding assays to 
identify novel NHL modulators. 

The present invention also relates to polyclonal and monoclonal antibodies 
5 raised in response to either the NHL or a biologically active fragment of NHL. 

The present invention relates to transgenic mice comprising altered genotypes 
and phenotypes in relation to NHL and its in vivo activity. 

The present invention also relates to NHL fusion constructs, including but not 
limited to fusion constructs which express a portion of the NHL protein linked to 

10 various markers, including but in no way limited to GI^P (Green fluorescent protein), 
the MYC epitope, and GST. Any such fusion constructs may be expressed in the cell 
line of interest and used to screen for NHL modulators. 

Therefore, the present invention relates to methods of expressing mammalian 
NHL, and preferably human NHL, biological equivalents disclosed herein, assays 

15 employing these gene products, recombinant host cells which comprise DNA 
constructs which express these proteins, and compounds identified through these 
assays which act as agonists or antagonists of NHL activity. 

The present invention also relates to the isolated genomic sequence which 
comprises SEQ ED N0:1, a 115 kb genomic fragment set forth herein as SEQ ID 

20 N0:3. As especially preferred aspect of this portion of the invention is the region of 
the genomic fragment of SEQ ID N0:3 which comprises the regulatory and coding 
regions of human NHL, as well as intervening sequences (introns). This 115 kb 
fragment contains at least the coding region of four genes, NHL, M68/DcR3, SCLIP 
and ARP. As discussed herein, it has been shown that this region of chromosome 20 

25 is associated with tumor growth. Therefore, an aspect of this invention also comprises 
the use of one or more regions of this 1 15 kb genomic sequence to identify 
compounds which up or downregulate expression of one or more of the genes 
localized within this 1 15 kb region, wherein this up or down regulation results in an 
interference of tumor growth. For example, a transcription element of one of these 

30 four genes may be responsible for M68/DcR3 ( and/or NHL) overexpression in 

tumors, and if M68 or NHL overexpression in tumors has a caustic role, blockage of 
M68/DcR3 or NHL overexpression in tumors by interfering with this transcription site 
will be useful. 
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It is an object of the present invention to provide an isolated nucleic acid 
molecule (e.g., SEQ ED N0:1) which encodes novel form of human NHL, or 
fragments, mutants or derivatives of human NHL as set forth in Figure 2 and SEQ ID 
N0:2. Any such polynucleotide includes but is not necessarily limited to nucleotide 
5 substitutions, deletions, additions, amino-terminal truncations and carboxy-terminal 
truncations such that these mutations encode mRNA which express a protein or 
protein fragment of diagnostic, therapeutic or prophylactic use and would be useful 
for screening for selective modulators of human NHL activity. 

It is a further object of the present invention to provide the mammalian, and 
10 especially human, NHL proteins or protein fragments encoded by the nucleic acid 
. molecules referred to in the preceding paragraph. 

It is a further object of the present invention to provide recombinant vectors 
and recombinant host cells which comprise a nucleic acid sequence encoding 
mammalian, and especially human, NHL protein and biological equivalent thereof. 
15 It is an object of the present invention to provide a substantially purified form 

of human NHL, as set forth in Figure 2 and SEQ ID N0:2. 

Is another object of the present invention to provide a substantially purified 
recombinant form of a NHL protein which has been obtained from a recombinant host 
cell transformed or transfected with a DNA expression vector which comprises and 
20 appropriately expresses a complete open reading frame as set forth in SEQ ED NO: 1, 
resulting in a functional, processed form of NHL. It is especially preferred is that the 
recombinant host cell be a eukaryotic host cell, such as a mammalian cell line. 

It is an object of the present invention to provide for biologically active 
fragments and/or mutants of mammalian, and especially human, NHL, such as set 
25 forth in SEQ ID N0:2, including but not necessarily limited to amino acid 

substitutions, deletions, additions, amino terminal truncations and carboxy-terminal 
truncations such that these mutations provide for proteins or protein fragments of 
diagnostic, therapeutic and/or prophylactic use. 

It is also an object of the present invention to use NHL proteins or biological 
30 equivalent to screen for modulators, preferably selective modulators,, of human NHL 
activity. Any such compound may be useful in screening for and selecting compounds 
active against human neoplastic disorders. 

As used herein, "substantially free from other nucleic acids" means at least 
90%, preferably 95%, more preferably 99%, and even more preferably 99.9%, free of 
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Other nucleic acids. Thus, a human NHL DNA preparation that is substantially free 
fronn other nucleic acids will contain, as a percent of its total nucleic acid, no more 
than 10%, preferably no more than 5%, more preferably no more than 1%, and even 
more preferably no more than 0.1%, of non-NHL nucleic acids. Whether a given 

5 NHL DNA preparation is substantially free from other nucleic acids can be 

determined by such conventional techniques of assessing nucleic acid purity as, e.g., 
agarose gel electrophoresis combined with appropriate staining methods, e.g., 
ethidium bromide staining, or by sequencing. 

As used herein, "substantially free from other proteins" or "substantially 

10 purified" means at least 90%, preferably 95%, more preferably 99%, and even more 
preferably 99.9%, free of other proteins. Thus, a NHL protein preparation that is 
substantially free from other proteins will contain, as a percent of its total protein, no 
more than 10%, preferably no more than 5%, more preferably no more than 1%, and 
even more preferably no more than 0.1%, of non-NHL proteins. Whether a given 

15 NHL protein preparation is substantially free from other proteins can be determined 
by such conventional techniques of assessing protein purity as, e.g., sodium dodecyl 
sulfate polyacrylamide gel electrophoresis (SDS-PAGE) combined with appropriate 
detection methods, e.g., silver staining or immunoblotting. As used interchangeably 
with the terms "substantially free from other proteins" or "substantially purified", the 

20 terms "isolated NHL protein" or "purified NHL protein" also refer to NHL protein that 
has been isolated from a natural source. Use of the term "isolated" or "purified" 
indicates that NHL protein has been removed from its normal cellular environment. 
Thus, an isolated NHL protein may be in a cell-free solution or placed in a different 
cellular environment from that in which it occurs naturally. The term isolated does 

25 not imply that an isolated NHL protein is the only protein present, but instead means 
that an isolated NHL protein is substantially free of other proteins and non-amino acid 
material {e.g., nucleic acids, lipids, carbohydrates) naturally associated with the NHL 
protein in vivo. Thus, a NHL protein that is recombinantly expressed in a prokaryotic 
or eukaryotic cell and substantially purified from this host cell which does not 

30 naturally {i.e., without intervention) express this protein is of course "isolated NHL 
protein" under any circumstances referred to herein. As noted above, a NHL protein 
preparation that is an isolated or purified NHL protein will be substantially free from 
other proteins will contain, as a percent of its total protein, no more than 10%, 
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preferably no more than 5%, more preferably no more than 1%, and even more 
preferably no more than 0.1%, of non-NHL proteins. 

As used interchangeably herein, "functional equivalent" or "biologically active 
equivalent" means a protein which does not have exactly the same amino acid 
5 sequence as naturally occurring NHL, due to alternative splicing, deletions, mutations, 
substitutions, or additions, but retains substantially the same biological activity as 
NHL. Such functional equivalents will have significant amino acid sequence identity 
with naturally occurring NHL and genes and cDNA encoding such functional 
equivalents can be detected by reduced stringency hybridization with a DNA sequence 

10 encoding naturally occurring NHL. For example, a naturally occurring NHL disclosed 
herein comprises the amino acid sequence shown as SEQ JD N0:2 and is encoded by 
SEQ ID N0:1. A nucleic acid encoding a functional equivalent has at least about 
50% identity at the nucleotide level to SEQ ID NO: 1 . 

As used herein, "a conservative amino acid substitution" refers to the 

15 replacement of one amino acid residue by another, chemically similar, amino acid 
residue. Examples of such conservative substitutions are: substitution of one 
hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., 
arginine for lysine; glutamic acid for aspartic acid). 

20 As used herein, the term "mammalian" will refer to any mammal, including a 

human being. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 A-B shows the nucleotide sequence which comprises the open reading 
25 frame which encodes human NHL, the nucleotide sequence set forth as SEQ ID N0:1. 
The initiating Met residue (ATG) and the stop codon (TAG) are underlined. 

Figure 2 shows the amino acid sequence of human NHL as set forth in SEQ ID 

N0:2. 

Figure 3 shows the alignment of amino acid sequences of human NHL to 
30 ERCC2/RAD3 gene family members. Rep D {Dictyosteliem discoideum); RAD 3 
(S. cerevisiae)\ RAD 15 (5. pombe) and XP_GroupD {Homo sapien). 

Figure 4 shows Northern analysis of NHL expression in multi-human tissues. 

Figure 5A-B show the genomic structure of the NHL gene (Figure 5A) and the 
entire 115 kb genomic region (Figure 5B) containing the NHL, M68/DcR3, SCLff 
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and ARP genes. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an isolated or purified nucleic acid molecule 
5 (polynucleotide) which encodes a novel mammalian DNA helicase. An especially 
preferred aspect of this invention relates to an isolated nucleic acid molecule 
(polynucleotide) which encodes mRNA which expresses a novel human DNA 
helicase, NHL. 

The gene M68/DcR3 is a secreted TNFR member that is overexpressed in a 

10 number of human tumors. M68/DcR3 is located at 20ql3.3, a known site that is 
associated with frequent gene amplification in cancer. M68/DcR3 protein binds, to 
FASL and inhibit FAS mediated apoptosis. Thus, genes tightly linked to M68/DcR3 
may be coregulated (e.g. co overexpressed and/or amplified in tumors). During the 
course of cloning the genomic M68/DcR3 fragment and identifying genes that are 

15 linked to M68/DcR3 at 20ql3.3, three genes, including a novel gene that is similar to 
the Rad3/ERCC2 helicase family, were identified (termed NHL) in the immediately 
adjacent (overlapping) region. Given NHL's chromosomal location and the frequent 
association of DNA helicases with human genetic disorders (mutations in DNA 
helicases have been found associated with multiple diseases, including xeroderma 

20 pigmentosum, Cockayne's syndrome, Bloom's syndrome, and Werner's syndrome), 
NHL is a candidate for contribution to certain human neoplastic disorders. To this 
end, the genomic clone for this gene is disclosed and the complete sequence is 
determined. The transcript was identified through exon prediction using GRAIL2 and 
sequence alignment to a contiguous 4.5 kilobase region of chromosome 4 (88% 

25 sequence identity). The complete exon structure of NHL was subsequently confirmed 
by RT-PCR analysis. Multiple sequence alignment of NHL to known helicases 
showed that NHL contains all the seven critical helicase domains. BLAST analysis of 
. the predicted 1,219 amino acid sequence revealed an approximately 26% sequence 
identity and 48% sequence similarity to the RAD3/ERCC2 gene family of DNA 

30 helicases (Naumovski et al., 1985 MoL Cell Biol 5:17-26; Reynolds et al., 1985 
^MddcAczd/?e^ 13:2357-72; Weber etal., 1990 fikfSO 7. 9:1437-1447). The 
mRNA expression pattern of NHL was also examined in multiple human tissues. 
Radiation hybrid chromosomal mapping reconfirms that it is linked to M68/DcR3 
locus. 
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A preferred aspect of the present invention relates to an isolated or purified 
DNA molecule which encodes human NHL, the nucleotide sequence as set forth in 
Figure lA-B and SEQ ID N0:1, which is as follows: 





AGTGAGCCCT 


GCTGCCAGCC 


AGTGCCGGGT 


GCTGGGGACT 


CAGGGAGGCC 


CGCCGGGACC 


5 


ACTGCGGGAC 


AGTGAGCCGA 


GCAGAAGCTG 


GAACGCAGGA 


GAGGAAGGAG 


AGGGGGCGGT 




CAGGGCTCTC 


AGGAGCCGGG 


TCCTGGGCAA 


GGCGCAGCCG 


TTTTCAAATT 


TTCAGGAAAG 




CGGTCGGCTC 


ACACTCGAGC 


AGTAAAAAGA 


TGCCTCTGGG 


GAGGAGGCCC 


GTGCAGCTCT 




CCGGGCAATG 


GTGGTGGCTC 


GGCCTAGAGA 


GGCGGTAGTG 


GAACGCAGAC 


CCTGGTGGGG 




GAATGACATC 


AAGGGAGGAG 


ACGGGCGGGA 


CCCCAGATTT 


CTGCCTGTGG 


GCGATGGAAG 


10 


TGAGGTTCAC 


TGGCCAGCGG 


AGCCGGACAC 


AGAACGCGCA 


AAACGCCGTG 


TAGGCCTGGA 




GGAGCCGAAG 


AGCAGGCGGA 


CCCCCTCCGC 


GGGGGAACAG 


TTTCCGCCGG 


GAGCACAAAG 




CAACGGACCG 


GAAGTGGGGG 


GCGGAAGTGC 


AGTGGGCTCA 


GCGCCGACTG 


CGCGCCTCTG 




CCCGCGAAAA 


CTCTGAGCTG 


GCTGACAGCT 


GGGGACGGGT 


GGCGGCCCTC 


GACTGGAGTC 




GGTTGAGTTC 


CTGAGGGACC 


CCGGTTCTGG 


AAGGTTCGCC 


GCGGAGACAA 


GTGAGGAGTC 


15 


TGTGCCATAG 


GGATTCTCGA 


AGAGAACAGC 


GTTGtGTCCC 


AGTGCACATC 


CTCGCATCGC 




TTACCAGGAG 


TGCCCGAGAC 


CCTAAGATGT 


TCGGAGTGGT 


TTTTTCGCAC 


AGACCCGAAT 




AGCCTGCCCC 


TCAGCCACGC 


TCTGTGCCCT 


TCTGAGAACA 


GGCTGATATG 


CCCAAGATAG 




TCCTGAATGG 


TGTGACCGTA 


GACTTCCCTT 


TCCAGCCCTA 


CAAATGCCAA 


CAGGAGTACA 




TGACCAAGGT 


CCTGGAATGT 


CTGCAGCAGA 


AGGTGAATGG 


CATCCTGGAG 


AGCCCTACGG 


20 


GTACAGGGAA 


GACGCTGTGC 


CTGCTGTGCA 


CCACGCTGGC 


CTGGCGAGAA 


CACCTCCGAG 




ACGGCATCTC 


TGCCCGCAAG 


ATTGCCGAGA 


GGGCGCAAGG 


AGAGCTTTTC 


CCGGATCGGG 




CCTTGTCATC 


CTGGGGCAAC 


GCTGCTGCTG 


CTGCTGGAGA 


CCCCATAGCT 


TGCTACACGG 




ACATCCCAAA 


GATTATTTAC 


GCCTCCAGGA 


CCCACTCGCA 


ACTCACACAG 


GTCATCAACG 




AGCTTCGGAA 


CACCTCCTAC 


CGGCCTAAGG 


TGTGTGTGCT 


GGGCTCCCGG 


GAGCAGCTGT 


25 


GCATCCATCC 


TGAGGTGAAG 


AAACAAGAGA 


GTAACCATCT 


ACAGATCCAC 


TTGTGCCGTA 




AGAAGGTGGC 


AAGTCGCTCC 


TGTCAtTTCT 


ACAACAACGT 


AGAAGAAAAA 


AGCCTGGAGC 




AGGAGCTGGC 


CAGCCCCATC 


CTGGACATTG 


AGGACTTGGT 


CAAGAGCGGA 


AGCAAGCACA 




GGGTGTGCCC 


TTACTACCTG 


TCCCGGAACC 


TGAAGCAGCA 


AGCCGACATC 


ATATTCATGC 




CGTACAATTA 


CTTGTTGGAT 


GCCAAGAGCC 


GCAGAGCACA 


CAACATTGAC 


CTGAAGGGGA 


30 


CAGTCGTGAT 


CTTTGACGAA 


GCTCACAACG 


TGGAGAAGAT 


GTGTGAAGAA 


TCGGCATCCT 




TTGACCTGAC 


TCCCCATGAC 


CTGGCTTCAG 


GACTGGACGT 


CATAGACCAG 


GTGCTGGAGG 




AGCAGACCAA 


GGCAGCGCAG 


CAGGGTGAGC 


CCCACCCGGA 


GTTCAGCGCG 


GACTCCCCCA 




GCCCAGGGCT 


GAACATGGAG 


CTGGAAGACA 


TTGCAAAGCT 


GAAGATGATC 


CTGCTGCGCC 




TGGAGGGGGC 


CATCGATGCT 


GTTGAGCTGC 


CTGGAGACGA 


CAGCGGTGTC 


ACCAAGCCAG 
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GGAGCTACAT 


CTTTGAGCTG 


TTTGCTGAAG 


CCCAGATCAC 


GTTTCAGACC 


AAGGGCTGCA 


TCCTGGACTC 


GCTGGACCAG 


ATCATCCAGC 


ACCTGGCAGG 


ACGTGCTGGA 


GTGTTCACCA 


ACACGGCCGG 


ACTGCAGAAG 


CTGGCGGACA 


TTATCCAGAT 


TGTGTTCAGT 


GTGGACCCCT 


CCGAGGGCAG 


CCCTGGTTCC 


CCAGCAGGGC 


TGGGGGCCTT 


ACAGTCCTAT 


AAGGTGCACA 


TCCATCCTGA 


TGCTGGTCAC 


CGGAGGACGG 


CTCAGCGGTC 


TGATGCCTGG 


AGCACCACTG 


CAGCCAGAAA 


GCGAGGGAAG 


GTGCTGAGCT 


ACTGGTGCTT 


CAGTCCCGGC 


CACAGCATGC 


ACGAGCTGGT 


CCGCCAGGGC 


GTCCGCTCCC 


TCATCCTTAC 


CAGCGGCACG 


CTGGCCCCGG 


TGTCCTCCTT 


TGCTCTGGAG 


ATGCAGATCC 


CTTTCCCAGT 


CTGCCTGGAG 


AACCCACACA 


TCATCGACAA 


GCACCAGATC 


TGGGTGGGGG 


TCGTCCCCAG 


AGGCCCCGAT 


GGAGCCCAGT 


TGAGGTCCGC 


GTTTGACAGA 


CGGTTTTCCG 


AGGAGTGCTT 


ATCCTCCCTG 


GGGAAGGCTC 


TGGGCAACAT 


CGCCCGCGTG 


GTGCCCTATG 


GGCTCCTGAT 


CTTCTTCCCT 


TCCTATCCTG 


TCATGGAGAA 


GAGCCTGGAG 


TTCTGGCGGG 


CCCGCGACTT 


GGCCAGGAAG 


ATGGAGGCGC 


TGAAGCCGCT 


GTTTGTGGAG 


CCCAGGAGCA 


AAGGCAGCTT 


CTCCGAGACC 


ATCAGTGCTT 


ACTATGCAAG 


GGTTGCCGCC 


CCTGGGTCCA 


CCGGCGCCAC 


CTTCCTGGCG 


GTCTGCCGGG 


GCAAGGCCAG 


CGAGGGGCTG 


GACTTCTCAG 


ACACGAATGG 


CCGTGGTGTG 


ATTGTCACGG 


GCCTCCCGTA 


CCCCCCACGC 


ATGGACCCCC 


GGGTTGTCCT 


CAAGATGCAG 


TTCCTGGATG 


AGATGAAGGG 


CCAGGGTGGG 


GCTGGGGGCC 


AGTTCCTCTC 


TGGGCAGGAG 


TGGTACCGGC 


AGCAGGCGTC 


CAGGGCTGTG 


AACCAGGCCA 


TCGGGCGAGT 


GATCCGGCAC 


CGCCAGGACT 


ACGGAGCTGT 


CTTCCTCTGT 


GACCACAGGT 


TCGCCTTTGC 


CGACGCAAGA 


GCCCAACTGC 


CCTCCTGGGT 


GCGTCCCCAC 


GTCAGGGTGT 


ATGACAACTT 


TGGCCATGTC 


ATCCGAGACG 


TGGCCCAGTT 


CTTCCGTGTT 


GCCGAGCGAA 


CTATGCCAGC 


GCCGGCCCCC 


CGGGCTACAG 


CACCCAGTGT 


GCGTGGAGAA 


GATGCTGTCA 


GCGAGGCGAA 


GTCGCCtGGC 


CCCTTCTTCT 


CCACCAGGAA 


AGCTAAGAGT 


CTGGACCTGC 


ATGTCCCCAG 


CCTGAAGCAG 


AGGTCCTCAG 


GGTCACCAGC 


TGCCGGGGAC 


CCCGAGAGTA 


GCCTGTGTGT 


GGAGTATGAG 


CAGGAGCCAG 


TTCCTGCCCG 


GCAGAGGCCC 


AGGGGGCTGC 


TGGCCGCCCT 


GGAGCACAGC 


GAACAGCGGG 


CGGGGAGCCC 


TGGCGAGGAG 


CAGGCCCACA 


GCTGCTCCAC 


CCTGTCCCTC 


CTGTCTGAGA 


AGAGGCCGGC 


AGAAGAACCG 


CGAGGAGGGA 


GGAAGAAGAT 


CCGGCTGGTC 


AGCCACCCGG 


AGGAGCCCGT 


GGCTGGTGCA 


CAGACGGACA 


GGGCCAAGCT 


CTTCATGGTG 


GCCGTGAAGC 


AGGAGTTGAG 


CCAAGCCAAC 


TTTGCCACCT 


TCACCCAGGC 


CCTGCAGGAC 


TACAAGGGTT 


CCGATGACTT 


CGCCGCCCTG 


GCCGCCTGTC 


TCGGCCCCCT 


CTTTGCTGAG 


GACCCCAAGA 


AGCACAACCT 


GCTCCAAGGC 


TTCTACCAGT 


TTGTGCGGCC 


CCACCATAAG 


CAGCAGTTTG 


AGGAGGTCTG 


TATCCAGCTG 


ACAGGACGAG 


GCTGTGGCTA 


TCGGCCTGAG 


CACAGCATTC 


CCCGAAGGCA 


GCGGGCACAG 


CCGGTCCTGG 


ACCCCACTGG 


AAGAACGGCG 


CCGGATCCCA 


AGCTGACCGT 


GTCCACGGCT 


GCAGCCCAGC 


AGCTGGACCC 


CCAAGAGCAC 


CTGAACCAGG 
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GCAGGCCCCA 


CCTGTCGCCC 


AGGCCACCCC 


CAACAGGAGA 


CCCTGGCAGC 


CAACCACAGT 


GGGGGTCTGG 


AGTGCCCAGA 


GCAGGGAAGC 


AGGGCCAGCA 


CGCCGTGAGC 


GCCTACCTGG 


CTGATGCCCG 


CAGGGCCCTG 


GGGTCCGCGG 


GCTGTAGCCA 


ACTCTTGGCA 


GCGCTGACAG 


CCTATAAGCA 


AGACGACGAC 


CTCGACAAGG 


TGCTGGCTGT 


GTTGGCCGCC 


CTGACCACTG 


CAAAGCCAGA 


GGACTTCCCC 


CTGCTGCACA 


GGTTCAGCAT 


GTTTGTGCGT 


CCACACCACA 


AGCAGCGCTT 


CTCACAGACG 


TGCACAGACC 


TGACCGGCCG 


GCCCTACCCG 


GGCATGGAGC 


CACCGGGACC 


CCAGGAGGAG 


AGGCTTGCCG 


TGCCTCCTGT 


GCTTACCCAC 


AGGGCTCCCC 


AACCAGGCCC 


CTCACGGTCC 


GAGAAGACCG 


GGAAGACCCA 


GAGCAAGATC 


TCGTCCTTCC 


TTAGACAGAG 


GCCAGCAGGG 


ACTGTGGGGG 


CGGGCGGTGA 


GGATGCAGGT 


CCCAGCCAGT 


CCTCAGGACC 


TCCCCACGGG 


CCTGCAGCAT 


CTGAGTGGGG 


CCTCTAGGAT 


GTGCCCAGCC 


TGCCACACCG 


CCTCCAGGAA 


GCAGAGCGTC 


ATGCAGGTCT 


TCTGGCCAGA 


GCCCCAGTGA 


GTGCCCACGG 


AGGCCCGCAG 


CACACCCAAC 


GTGGCTTGAT 


CACCTGCCTG 


TCCAGCTCTG 


GTGGGCCAAG 


AACCCACCCA 


ACAGAATAGG 


CCAGCCCATG 


CCAGCCGGCT 


TGGCCCGCTG 


CAGGCCTCAG 


GCAGGCGGGG 


CCCATGGTTG 


GTCCCTGCGG 


TGGGACCGGA 


TCTGGGCCTG 


CCTCTGAGAA 


GCCCTGAGCT 


ACCTTGGGGT 


CTGGGGTGGG 


TTTCTGGGAA 


AGTGCTTCCC 


CAGAACTTCC 


CTGGCTCCTG 


GCCTGTGAGT 


GGTGCCACAG 


GGGCACCCCA 


GCTGAGCCCC 


TCACCGGGAA 


GGAGGAGACC 


CCCGTGGGCA 


CGTGTCCACT 


TTTAATCAGG 


GGACAGGGCT 


CTCTAATAAA 


GCTGCTGGCA 


GTGCCC ( SEQ ID NO : 1 ) . 







The above-exemplified isolated DNA molecule shown in Figure 1 A-B and 
20 SEQ ED NO: 1 comprise 4946 nucleotides, with an initiating Met at nucleotides 828- 

830 and a "TAG" termination codon at nucleotides 4585-4587. The initiating Met and 

TAG termination codon are underlined. 

The present invention also relates to biologically active fragments or mutants 

of SEQ ID N0:1 which encode a mRNA molecule expressing a novel DNA helicase, 
25 NHL. Any such biologically active fragment and/or mutant will encode either a 

protein or protein fragment which at least substantially mimics the biological 

properties of the human NHL protein disclosed herein in Figure 2 and as set forth as 

SEQ E) N0:2. Any such polynucleotide includes but is not necessarily limited to 

nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy- 
30 terminal truncations such that these mutations encode mRNA which express a 

functional NHL protein in a host cell, so as to be useful for screening for agonists 

and/or antagonists of NHL activity. 

The isolated nucleic acid molecules of the present invention may include a 

deoxyribonucleic acid molecule (DNA), such as genomic DNA and complementary 
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DNA (cDNA), which may be single (coding or noncoding strand) or double stranded, 

as well as synthetic DNA, such as a synthesized, single stranded polynucleotide. The 

isolated nucleic acid molecule of the present invention may also include a ribonucleic 

acid molecule (RNA). 
5 The present invention also relates to recombinant vectors and recombinant 

hosts, both prokaryotic and eukaryotic, which contain the substantially purified 

nucleic acid molecules disclosed throughout this specification. 

The degeneracy of the genetic code is such that, for all but two amino 

acids, more than a single codon encodes a particular amino acid. This allows for 
10 the construction of synthetic DNA that encodes the NHL protein where the 

nucleotide sequence of the synthetic DNA differs significantly from the nucleotide 

sequence of SEQ ED NO: 1 but still encodes the same NHL protein as SEQ ID 

NO:2. Such synthetic DNAs are intended to be within the scope of the present 

invention. If it is desired to express such synthetic DNAs in a particular host cell 
15 or organism, the codon usage of such synthetic DNAs can be adjusted to reflect 

the codon usage of that particular host, thus leading to higher levels of expression 

of the NHL protein in the host. In other words, this redundancy in the various 

codons which code for specific amino acids is within the scope of the present 

invention. Therefore, this invention is also directed to those DNA sequences 
20 which encode RNA comprising alternative codons which code for the eventual 

translation of the identical amino acid, as shown below: 

A=Ala=Alanine: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGC, UGU 

D=Asp=Aspartic acid: codons GAC, GAU 
25 E=GIu=Glutamic acid: codons GAA, GAG 

F=Phe=Phenylalanine: codons UUC, UUU 

G=Gly=Glycine: codons GGA, GGC, GGG, GGU 

H=His =Histidine: codons CAC, CAU 

I=Ile =IsoIeucine: codons AUA, AUG, AUU 
30 K=Lys=Lysine: codons- AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 

M=Met=Methionine: codon AUG 

N=Asp=Asparagine: codons AAC, AAU 

p=Pro=Proline: codons CCA, CCC, CCG, CCU 
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Q=GIn=Glutamine: codons CAA, CAG 
R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 
S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 
T=:Thn=Threonine: codons ACA, ACC, ACG, ACU 
5 V=VaI=:VaIine: codons GUA, GUC, GUG, GUU 
W=Trp=Tryptophan: codon UGG 
Y=Tyr=Tyrosine: codons UAC, UAU 

Therefore, the present invention discloses codon redundancy which naay result in 
differing DNA molecules expressing an identical protein. For purposes of this 

10 specification, a sequence bearing one or more replaced codons will be defined as a 
degenerate variation. Also included within the scope of diis invention are 
mutations eidier in the DNA sequence or the translated protein which do not 
substantially alter the ultimate physical properties of the expressed protein. For 
example, substitution of valine for leucine, arginine for lysine, or asparagine for 

15 glutamine may not cause a change in functionality of the polypeptide. 

It is known that DNA sequences coding for a peptide may be altered so as 
to code for a peptide having properties that are different than those of the naturally 
occurring peptide. Methods of altering the DNA sequences include but are not 
limited to site directed mutagenesis. Examples of altered properties include but 

20 are not limited to changes in the affinity of an enzyme for a substrate or a receptor 
for a ligand. 

The present invention also relates to recombinant vectors and recombinant 
hosts, both prokaryotic and eukaryotic, which contain the substantially purified 
nucleic acid molecules disclosed throughout this specification. The nucleic acid 

25 molecules of the present invention encoding a NHL protein, in whole or in part, can 
be linked with other DNA molecules, i.e, DNA molecules to which the NHL coding 
sequence are not naturally linked, to form "recombinant DNA molecules" which 
encode a respective NHL protein. The novel DNA sequences of the present invention 
can be inserted into vectors which comprise nucleic acids encoding NHL or a 

30 functional equivalent. These vectors may be comprised of DNA or RNA; for most 
cloning purposes DNA vectors are preferred. Typical vectors include plasmids, 
modified viruses, bacteriophage, cosmids, yeast artificial chromosomes, and other 
forms of episomal or integrated DNA that can encode a NHL protein. It is well within 
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the purview of the skilled artisan to detennine an appropriate vector for a particular 
gene transfer or other use. 

Included in the present invention are DNA sequences that hybridize to SEQ E) 
NO: 1 under stringent conditions. By way of example, and not limitation, a procedure 

5 using conditions of high stringency is as follows: Prehybridization of filters containing 
DNA is carried out for 2 hours to overnight at 65°C in buffer composed of 6X SSC, 
5X Denhardt's solution, and 100 /ig/ml denatured salmon sperm DNA. Filters are 
hybridized for 12 to 48 hrs at 65°C in prehybridization mixture containing 100 fig/ml 
denatured salmon sperm DNA and 5-20 X 10^ cpm of ^^P-Iabeled probe. Washing of 

10 filters is done at 3TC for I hr in a solution containing 2X SSG, 0.1% SDS. This is 
followed by a wash in O.IX SSC, 0.1% SDS at SO^'C for 45 min. before 
autoradiography. Other procedures using conditions of high stringency would include 
either a hybridization step carried out in 5XSSC, 5X Denhardt's solution, 50% 
formamide at 42°C for 12 to 48 hours or a washing step carried out in 0.2X SSPE, 

15 0.2% SDS at 65°C for 30 to 60 minutes. 

Reagents mentioned in the foregoing procedures for carrying out high 
stringency hybridization are well known in the art. Details of the composition of these 
reagents can be found in, e.g., Sambrook et al., 1989, Molecular Cloning: A 
Laboratory Manual', Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. 

20 In addition to the foregoing, other conditions of high stringency which may be used 
are well known in the art. 

The present invention also relates to a substantially purified form of a human 
NHL protein which comprises the amino acid sequence (1219 amino acid residues) 
disclosed in Figure 2 and set forth as SEQ ID N0:2. A preferred aspect of this portion 

25 of the present invention is a NHL protein which consists of the amino acid sequence 
disclosed in Figure 2 and set forth as SEQ ID N0:2, as follows: 



MPKIVLNGVT 


VDFPFQPYKC 


QQEYMTKVLE 


CLQQKVNGIL 


ESPTGTGKTL 


CLLCTTLAWR 


EHLRDGISAR 


KIAERAQGEL 


FPDRALSSWG 


NAAAAAGDPI 


ACYTDIPKII 


YASRTHSQLT 


QVINELRNTS 


YRPKVCVLGS 


REQLCIHPEV 


KKQESNHLQI 


HLCRKKVASR 


SCHFYNNVEE 


KSLEQELASP 


ILDIEDLVKS 


GSKHRVCPYY 


LSRNLKQQAD 


IIFMPYNYLL. 


DAKSRRAHNI 


DLKGTWIFD 


EAHNVEKMCE 


ESASFDLTPH 


DLASGLDVID 


QVLEEQTKAA 


QQGEPHPEFS 


ADSPSPGLNM 


ELEDIAiCLKM 


ILLRLEGAID 


AVELPGDDSG 


VTKPGSYIFE 


LFAEAQITFQ 


TKGCILDSLD 


QIIQHLAGRA 


GVFTNTAGLQ 


KLADIIQIVF 


SVDPSEGSPG 


SPAGLGALQS 


YKVHIHPDAG 


HRRTAQRSDA 


WSTTAARKRG 


KVLSYWCFSP 


GHSMHELVRQ 


GVRSLILTSG 
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TLAPVSSFAL 


EMQIPFPVCL 


ENPHIIDKHQ 


IWVGWPRGP 


DGAQLSSAFD 


RRFSEECLSS 


LGKALGNIAR 


WPYGLLIFF 


PSYPVMEKSL 


EFWRARDLAR 


KMEALKPLFV 


EPRSKGSFSE 


TISAYYARVA 


APGSTGATFL 


AVCRGKASEG 


LDFSDTNGRG 


VIVTGLPYPP 


RMDPRWLKM 


QFLDEMKGQG 


GAGGQFLSGQ 


EWYRQQASRA 


VNQAIGRVIR 


HRQDYGAVFL 


CDHRFAFADA 


RAQLPSWVRP 


HVRVYDNFGH 


VIRDVAQFFR 


VAERTMPAPA 


PRATAPSVRG 


EDAVSEAKSP 


GPFFSTRKAK 


SLDLHVPSLK 


QRSSGSPAAG 


DPESSLCVEY 


EQEPVPARQR 


PRGLLAALEH 


SEQRAGSPGE 


EQAHSCSTIiS 


LLSEKRPAEE 


PRGGRKKIRL 


VSHPEEPVAG 


AQTDRAKLFM 


VAVKQELSQA 


NFATFTQALQ 


DYKGSDDFAA 


LAACLGPLFA 


EDPKKHNLLQ 


GFYQFVRPHH 


KQQFEEVCIQ 


LTGRGCGYRP 


EHSIPRRQRA 


QPVLDPTGRT 


APDPKLTVST 


AAAQQLDPQE 


HLNQGRPHLS 


PRPPPTGDPG 


SQPQWGSGVP 


RAGKQGQHAV 


SAYLADARRA 


LGSAGCSQLL 


AALTAYKQDD 


DLDKVLAVLA 


ALTTAKPEDF 


PLLHRFSMFV 


RPHHKQRFSQ 


TCTDLTGRPY 


PGMEPPGPQE 


ERLAVPPVLT 


HRAPQPGPSR 


SEKTGKTQSK 


ISSFLRQRPA 


GTVGAGGEDA 


GPSQSSGPPH 


GPAASEWGL* 


(SEQ ID NO; 


:2) . 







The present invention also relates to biologically active fragments and/or 



15 mutants of the human NHL protein comprising the amino acid sequence as set forth in 
SEQ ID N0:2, including but not necessarily limited to amino acid substitutions, 
deletions, additions, anaino terminal truncations and carboxy-terminal truncations such 
that these mutations provide for proteins or protein fragments of diagnostic, 
therapeutic or prophylactic use and would be useful for screening for agonists and/or 

20 an tagoni sts of NHL f unc ti on . 

Another preferred aspect of the present invention relates to a substantially 
purified, fully processed NHL protein obtained from a recombinant host cell 
containing a DNA expression vector which comprises a nucleotide sequence as set 
forth in SEQ ID NO:l and expresses the human NHL protein. It is especially 

25 preferred is that the recombinant host cell be a eukaryotic host cell, such as a 
mammalian cell line. 

As with many proteins, it is possible to modify many of the amino acids of 
NHL protein and still retain substantially the same biological activity as the wild type 
protein. Thus this invention includes modified NHL polypeptides which have amino 

30 acid deletions, additions, or substitutions but that still retain substantially the same 
biological activity as a respective, corresponding NHL. It is generally accepted that 
single amino acid substitutions do not usually alter the biological activity of a protein 
(see, e.g.. Molecular Biology of the Gene, Watson et a/., 1987, Fourth Ed., The 
Benjamin/Cummings Publishing Co., Inc., page 226; and Cunningham & Wells, 1989, 
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Science 244:1081-1085); Accordingly, the present invention includes a polypeptide 
where one amino acid substitution has been made in SEQ ID N0:2 wherein the 
polypeptide still retains substantially the same biological activity as a corresponding 
NHL protein. The present invention also includes polypeptides where two or more 

5 amino acid substitutions have been made in SEQ ID N0:2 wherein the polypeptide 
still retains substantially the same biological activity as a corresponding NHL protein. 
In particular, the present invention includes embodiments where the above-described 
substitutions are conservative substitutions. 

One skilled in the art would also recognize that polypeptides that are 

10 functional equivalents of NHL and have changes from the NHL amino acid sequence 
that are small deletions or insertions of amino acids could also be produced by 
following the same guidelines, (i.e, minimizing the differences in amino acid 
sequence between NHL and related proteins. Small deletions or insertions are 
generally in the range of about 1 to 5 amino acids). The effect of such small deletions 

15 or insertions on the biological activity of the modified NHL polypeptide can easily be 
assayed by producing the polypeptide synthetically or by making the required changes 
in DNA encoding NHL and then expressing the DNA recombinantly and assaying the 
protein produced by such recombinant expression. 

The present invention also includes truncated forms of NHL which contain the 

20 region comprising the active site of the enzyme. Such truncated proteins are useful in 
various assays described herein, for crystallization studies, and for structure-activity- 
relationship studies. 

The present invention also relates to isolated nucleic acid molecules which are 
fusion constructions expressing fusion proteins useful in assays to identify compounds 

25 which modulate wild-type NHL activity, as well as generating antibodies against 
NHL. One aspect of this portion of the invention includes, but is not limited to, 
glutathione S-transferase (GST)-NHL fusion constructs. Recombinant GST-NHL 
fusion proteins may be expressed in various expression systems, including Spodoptera 
frugiperda (Sf21) insect cells (Invitrogen) using a baculovirus expression vector 

30 (pAcG2T, Pharmingen). Another aspect involves NHL fusion constructs linked to 
various markers, including but not limited to GFP (Green fluorescent protein), the 
MYC epitope, and GST. Again, any such fusion constructs may be expressed in the 
cell line of interest and used to screen for modulators of one or more of the NHL 
proteins disclosed herein. 
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Any of a variety of procedures may be used to clone NHL. These methods 
include, but are not limited to, (1) a RACE PGR cloning technique (Frohman, et aL, 
1988, Proc, Natl, Acad. Sci USA S5: 8998-9002). 5' and/or 3' RACE may be 
performed to generate a full-length cDNA sequence. This strategy involves using 

5 gene-specific oligonucleotide primers for PCR amplification of NHL cDNA. These 
gene-specific primers are designed through identification of an expressed sequence 
tag (EST) nucleotide sequence which has been identified by searching any number of 
publicly available nucleic acid and protein databases; (2) direct functional expression 
of the NHL cDNA following the construction of a. NHL-containing cDNA library in 

10 an appropriate expression vector system; (3) screening a NHL-containing cDNA 
library constructed in a bacteriophage or plasmid shuttle vector with a labeled 
degenerate oligonucleotide probe designed from the amino acid sequence of the NHL 
protein; (4) screening a NHL-containing cDNA library constructed in a bacteriophage 
or plasmid shuttle vector with a partial cDNA encoding the NHL protein. This partial 

15 cDNA is obtained by the specific PCR amplification of NHL DNA fragments through 
the design of degenerate oligonucleotide primers from the amino acid sequence 
known for other kinases which are related to the NHL protein; (5) screening a NHL- 
containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with 
a partial cDNA or oligonucleotide with homology to a mammalian NHL protein. This 

20 strategy may also involve using gene-specific oligonucleotide primers for PCR 
amplification of NHL cDNA identified as an EST as described above; or 
(6) designing 5' and 3' gene specific oligonucleotides using SEQ ED NO: 1 as a 
template so that either the full-length cDNA may be generated by known RACE 
techniques, or a portion of the coding region may be generated by these same known 

25 RACE techniques to generate and isolate a portion of the coding region to use as a 
probe to screen one of numerous types of cDN A and/or genomic libraries in order to 
isolate a full-length version of the nucleotide sequence encoding NHL. 

It is readily apparent to those skilled in the art that other types of libraries, as 
well as libraries constructed from other cell types-or species types, may be useful for 

30 isolating a NHL-encoding DNA or a NHL homologue. Other types of libraries 
include, but are not limited to, cDNA libraries derived from other cells. 

It is readily apparent to those skilled in the art that suitable cDNA libraries 
may be prepared from cells or cell lines which have NHL activity. The selection of 
cells or cell lines for use in preparing a cDNA library to isolate a cDNA encoding 
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NHL may be done by first measuring cell-associated NHL activity using any known 
assay available for such a purpose. 

Preparation of cDNA libraries can be perfonned by standard techniques well 
known in the art. Well known cDNA library construction techniques can be found for 

5 example, in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York. Complementary DNA 
libraries may also be obtained from numerous commercial sources, including but not 
limited to Clontech Laboratories, Inc. and Stratagene. 

It is also readily apparent to those skilled in the art that DNA encoding NHL 

10 may also be isolated from a suitable genomic DNA library. Construction of genomic 
DNA libraries can be performed by standard techniques well known in the art. Well 
known genomic DNA library construction techniques can be found in Sambrook, et 
al., supra. One may prepare genomic libraries, especially in PI artificial chromosome 
vectors, from which genomic clones containing the NHL gene can be isolated, using 

15 probes based upon the NHL nucleotide sequences disclosed herein. Methods of 
preparing such libraries are known in the art (loannou et al., 1994, Nature Genet. 
6:84-89). 

In order to clone a NHL gene by one of the preferred methods, the amino acid 
sequence or DNA sequence of a NHL or a homologous protein may be necessary. To 

20 accomplish this, a respective NHL protein may be purified and the partial amino acid 
sequence determined by automated sequenators. It is not necessary to determine the 
entire amino acid sequence, but the linear sequence of two regions of 6 to 8 amino 
acids can be determined for the PGR amplification of a partial NHL DNA fragment. 
Once suitable anuno acid sequences have been identified, the DNA sequences capable 

25 of encoding them are synthesized. Because the genetic code is degenerate, more than 
one codon may be used to encode a particular amino acid, and therefore, the amino 
acid sequence can be encoded by any of a set of similar DNA oligonucleotides. Only 
one member of the set will be identical to the NHL sequence but others in the set will 
be capable of hybridizing to NHL DNA even in the presence of DNA 

30 oligonucleotides with mismatches. The mismatched DNA oligonucleotides may still 
sufficiently hybridize to the NHL DNA to permit identification and isolation of NHL 
encoding DNA. Alternatively, the nucleotide sequence of a region of an expressed 
sequence may be identified by searching one or more available genomic databases. 
Gene-specific primers may be used to perform PCR amplification of a cDNA of . 
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interest from either a cDNA library or a population of cDNAs. As noted above, the 
appropriate nucleotide sequence for use in a PCR-based method may be obtained 
from SEQ ED N0:1 either for the purpose of isolating overlapping 5' and 3' RACE 
products for generation of a full-length sequence coding for NHL, or to isolate a 

5 portion of the nucleotide sequence coding for NHL for use as a probe to screen one or 
more cDNA- or genomic-based libraries to isolate a full-length sequence encoding 
NHL or NHL-like proteins. 

This invention also includes vectors containing a NHL gene, host cells 
containing the vectors, and methods of making substantially pure NHL protein 

10 comprising the steps of introducing the NHL gene into a host cell, and cultivating the 
host cell under appropriate conditions such that NHL is produced. The NHL so 
produced may be harvested from the host cells in conventional ways. Therefore, the 
present invention also relates to methods of expressing the NHL protein and 
biological equivalents disclosed herein, assays employing these gene products, 

15 recombinant host cells which comprise DNA constructs which express these proteins, 
and compounds identified through these assays which act as agonists or antagonists of 
NHL activity. 

The cloned NHL cDNA obtained through the methods described above may 
be recombinantly expressed by molecular cloning into an expression vector (such as 

20 pcDNA3.neo, pcDNAB.l, pCR2.1, pBlueBacHis2 or pLITMUS28) containing a 
suitable promoter and other appropriate transcription regulatory elements, and 
transferred into prokaryotic or eukaryotic host cells to produce recombinant NHL. 
Expression vectors are defined herein as DNA sequences that are required for the 
transcription of cloned DNA and the translation of their mRNAs in an appropriate 

25 host. Such vectors can be used to express eukaryotic DNA in a variety of hosts such 
as bacteria, blue green algae, plant cells, insect cells and. animal cells. Specifically 
designed vectors allow the shuttling of DNA between hosts such as bacteria-yeast or 
bacteria-animal cells. An appropriately constructed expression vector should contain: 
an origin of replication for autonomous replication in host cells, selectable markers, a 

30 limited number of useful restriction enzyme sites, a potential for high copy number, 
and active promoters. A promoter is defined as a DNA sequence that directs RNA 
polymerase to bind to DNA and initiate RNA synthesis. A strong promoter is one 
which causes mRNAs to be initiated at high frequency. To determine the NHL cDNA 
sequence(s) that yields optimal levels of NHL, cDNA molecules including but not 
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limited to the following can be constructed: a cDNA fragment containing the full- 
length open reading frame for NHL as well as various constructs containing portions 
of the cDNA encoding only specific domains of the protein or rearranged domains of 
the protein. All constructs can be designed to contain none, all or portions of the 5' 

5 and/or 3' untranslated region of a NHL cDNA, The expression levels and activity of 
NHL can be determined following the introduction; both singly and in combination, 
of these constructs into appropriate host cells. Following determination of the NHL 
cDNA cassette yielding optimal expression in transient assays, this NHL cDNA 
construct is transferred to a variety of expression vectors (including recombinant 

10 viruses), including but not limited to those for mammalian cells, plant cells, insect 
cells, oocytes, bacteria, and yeast cells. Techniques for such manipulations can be 
found described in Sambrook, et al., supra, are well known and available to the 
artisan of ordinary skill in the art. Therefore, another aspect of the present invention 
includes host ceils that have been engineered to contain and/or express DNA 

15 sequences encoding the NHL protein. An expression vector containing DNA 

encoding a NHL-like protein may be used for expression of NHL in a recombinant 
host cell. Such recombinant host cells can be cultured under suitable conditions to 
produce NHL or a biologically equivalent form. Expression vectors may include, but 
are not limited to, cloning vectors, modified cloning vectors, specifically designed 

20 plasmids or viruses. Commercially available mammalian expression vectors which 
may be suitable for recombinant NHL expression, include but are not limited to, 
pcDNA3.neo (Invitrogen). pcDNAS.l (Invitrogen), pCI-neo (Promega), pLITMUS28, 
pLITMUS29, pLTTMUSSS and pLITMUS39 (New England Bioloabs), pcDNAI, 
pcDNAIamp (Invitrogen), pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl 

25 (Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593) pBPV-l(8-2) 

(ATCC 37110), pdBPV-MMTneo(342-i2) (ATCC 37224), pRSVgpt (ATCC 37199), 
pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), and 
1ZD35 (ATCC 37565). Also, a variety of bacterial expression vectors may be used to 
express recombinant NHL in bacterial cells. Commercially available bacterial 

30 expression vectors which may be suitable for recombinant NHL expression include, 
but are not limited to pCR2.1 (Invitrogen), pETl la (Novagen), lambda gtl 1 
(Invitrogen), and pKK223-3 (Pharmacia). In addition, a variety of fungal cell 
expression vectors may be used to express recombinant NHL in fungal cells. 
Commercially available fungal cell expression vectors which may be suitable for 
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recombinant NHL expression include but are not linnited to pYES2 (Invitrogen) and 
Pichia expression vector (Invitrogen). Also, a variety of insect cell expression vectors 
may be used to express recombinant protein in insect cells. Commercially available 
insect cell expression vectors which may be suitable for recombinant expression of 

5 NHL include but are not limited to pBlueBacin and pBlueBacHis2 (Invitrogen), and 
pAcG2T (Phamiingen). 

Recombinant host cells may be prokaryotic or eukaryotic, including but not 
limited to, bacteria such as E. coli, fungal cells such as yeast, mammalian cells 
including, but not limited to, cell lines of bovine, porcine, monkey and rodent origin; 

10 and insect cells including but not limited to Drosophila and silkworm derived cell 
lines. For instance, one insect expression system utilizes Spodoptera frugiperda - 
(Sf21) insect cells (Invitrogen) in tandem with a baculovirus expression vector 
(pAcG2T, Phanningen). Also, mammalian species which may be suitable and which 
are commercially available, include but are not limited to, L cells L-M(TK") (ATCC 

15 CCL 1.3), L cells L-M (ATCC CCL 1.2), Saos-2 (ATCC HTB-85), 293 (ATCC CRL 
1573), Raji (ATCC CCL 86), CV-I (ATCC CCL 70), COS-1 (ATCC CRL 1650), 
COS-7 (ATCC CRL 1651), CHO-Kl (ATCC CCL 61), 3T3 (ATCC CCL 92), 
N1H/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616),-BS- 
C-1 (ATCC CCL 26), MRC-5 (ATCC CCL 171) and CPAE (ATCC CCL 209). 

20 As disclosed in Example section 1 , a 1 15 kb BAC clone (from Genome 

Systems) was subcloned and subjected to restriction and sequence analysis. Four 
genes at chromosome location 20ql3.3 were identified, including M68/DcR3, NHL, 
SCLIP and ARP (Figure 5A). The nucleotide sequence of this BAC clone, hbml68, 
is presented as follows: 



TGAAGAGCTT 


TGACCAAGAG 


GCTGTGACGA 


GGCCCTACGA 


GGACTCTGGC 


TCTCCTCCTG 


60 


CTAAGCACAC 


CCAGGCAGGT 


GTCCTGGCAG 


ATGAGGACCA 


CATGCAGAGC 


CTCGGCCAGC 


120 


CCACCAATGC 


CCGGATATGC 


AAGTGAGCCC 


AGCCTGGACC 


CCCCGGCGAG 


GCCCAGCAGC 


180 


ACCAGCCCAG 


GCCCGAAAAC 


CTTAAGAAAT 


GACCAGTGTC 


TGCTGCTTTA 


AGCCACCAAG 


240 


CTCTGCGGTG 


GTTTGTTAGG 


CTGCAAGCAT 


GGCTAATTCA 


GAAACTGCCA 


GAAACAAGCA 


300 


CTGCTGTCCC 


CAGCCTGGGA 


CACACAGCAC 


CGCCTCTGCG 


TGGGGAGAGG 


GCACAGGCTA 


360 


AGGGCACAAA 


TGCCATCCCA 


GACCCGGCTC 


TTGTGTGTGG 


AAGGGGCCAC 


TGTGCCATGA 


420 


GGCAGAGGAA 


ACCTTGGCAG 


GACCTTATGC 


CACAGCAATT 


TAAAAGAGAA 


GAAACAGGCT 


480 


GGGCGTGGTG 


GCTCATGCCT 


ATAATCCCAG 


' CACTTTGGGA 


GGCCAAGGTG 


GTGGATCACT 


540 


TGAGGTCAGG 


AGTTCAAGAC 


CAGCCTGGCC 


AATATGGTGA 


AACCCTGTCT 


CTACGAAAAA 


600 
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TACAAAATTT AGGCAGGCGT GGTGGCGGGT 
GGCAAGAGAT TTACTTGAAC CCAGGAGGTG 
ACTGCACTCC AGCCTGTGTG ACGGAGTGAG 
ATCTGACTAG TGTGATCTCG . CAAGGAACAT 

5 TCCAAACAAG GAATCCCCAG GGGATCAAAT 
AGCCTGTAAC CCCAGTGCTT CGGGAGGCTG 
CAAGACTAGC TTGGGCAACA CAGTGAGAGC 
CAGTCCCAGC ACTGTACTAA AAATCTACAC 
AGTCCCAGCT ACTCAGGAGG CTGAGGCAGG 

10 GCCATGAGCC GTGACTGTGC CACTGCACTC 
CGAAAAACAA AAAATCATTT CATGTCTCCA 
GTAAAAAATG GTTCTGACCC ACAGAAATCT 
TTTAAAAAAG AGATGGTCTC AGAATAAAGA 
TCACACCTTA GAAAGCTTTC TGGCCGCATC 

15 ATCAGTGAGG TGCCTTCGAA CAAGCCACCT 
CCGCACGCTC AGTAGGGGAG GGGAAGTCGC 
GACGTGCCAT AACCAAGAAG CCCCAGCCAC 
GCTGGGAAGG CCCAGGAAAA GGCGGGAGGC 
AGAAATGATC TCAACGTTTT GTAACTTCCT 

20 ACCAGCACTT GGTGATGTCG CTTGCGTCAA 
CTTTCCTCTG CAACTGTGGG CTTACGGGGC 
ACAGACCGGG CAAAGCAGAG GAGAGCCACC 
TCCCTCCAGA GACCCCTGTG GCCAAGGCCA 
AGCTGACGAC AGGGGTTGGT GAGGCCGGCC 

25 GAATTAAAAA TAATAACAAC TATGTCTTGT 
TCTTTCTTTT TCTTTTTTTT TTTCGGAGAC 
GCAGTGGCGC AATCTTGGCT CACTGCAAGC 
CCTCAGCCTC CTGAGTAACT GGGACTACAG 
CAAAAGAAAA CTGACACAAT CATAAACAGA 

30 CTCTACTTCT CACTGTTCTT CAAAGAGTTA 
TAACATCCTA GAAAAAAAGC TCCTACTCAG 
CCACGTTCCA CAGTGACCAC AGGAAACCAA 
ACTGCGCCAG CCGTGCGCCT GTGTCTGCCA 
CAAGACACCC ACAGAGGAGA GCTCTAAGCC 
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GCCTGTAATC 


CCTGCTATTC 


AGGAGGCTGA 


660 


GAGGCTGCTG 


CAGTGAGCTG 


AGATCATGCC 


720 


ACTTGGTCTC 


AAAAAAAAAA 


AAGGAAACAC 


780 


TCCAGACACA 


GTGGAGCTAG 


AAGGTTCTTC 


840 


TGTTTTGCAT 


CGGCCAGACA 


TGGTGGCTCA 


900 


AGGTGGGAGG 


ACTGCTTGAG 


TCCAGGAGTT , 


960 


CCATTAGCCA 


GGCGTGGTGG 


CACATGCCTG 


1020 


GGGGCCGGGC 


ATGGTGGCAC 


ATGCCTGTAG 


1080 


ACGATTCCTT 


GAACCCAGGA 


GGTCACGGCT 


1140 


CAGTCTGTGC 


AACAGAACGA 


GACTCTGTTT 


1200 


GTTTCTCCAC 


TGGCAAAAGA 


CTCTGTCAAG 


1260 


AAGAAAGGAA 


AAAATATAAA 


AAATAGAAAA 


1320 


CCAACCTGGG 


CTATGGTTGT 


CACTCTTCCC 


1380 


TGGCCAAAGG 


GCCACCCTGC 


CCCATCTTGG 


1440 


GCCCTGGAGC 


CCGTCCTGTC 


TTGTCTGCCA 


1500 


TAGGTTTTAG 


TTCACCAGTC 


TCTGGATCAA 


1560 


ACCCAGACCC 


GATGTGGCCA 


CAAGGGGTGA 


1620 


GGACGAATGG 


AAATGTCATT 


CTGTGGCCAC 


1680 


ACCAAGAGGC 


AGTCTTAGCT 


CTGCCCTTGA 


1740 


TCAAGGCAAC 


AGAAGTGAGC 


AGGAGGCCCA 


1800 


AAAGAAGTCC 


AGGCCTCCAG 


GTGGAGGATC 


1860 


CAGCCGAGCC 


TACCTGTGCC 


TCAGACTGCC 


1920 


CCCAGACCAG 


CAGGTCCTTG 


CCAAGCTGTC 


1980 


CAGACCAGCA 


GAACCACGAA 


CCAACCAACA 


2040 


CTTAAGCCAC 


TAAGTTTTGG 


ATGGTTTCTT 


2100 


GCAGTCTCAC 


TCTGTTGCCC 


AGGCTGGAGT 


2160 


TCTGCCCCCC 


GGATTCACGC 


CATTCCCCTG 


2220 


GTGCCTGCCA 


TTGGGTGTTT 


TCTTAAACAG 


2280 


GCAAGCAAGA. 


GAACTTGGCA 


ATTATTTCCT 


2340 


ACTCAAGCAT 


AAGATGTGAG 


CAAATTCTTT 


2400 


TGTTCATAAA 


GCAAAGCTAA 


CCTACAGGAG 


2460 


GACAGCAAGT 


GGGACACCAG 


CCTCCAGGGC 


2520 


CTGCCCTGGT 


CCGTCACTGC 


CACCAGCCGG 


2580 


ACAACTGTGT 


ACGAAGACAA 


CTGTGCAGGA 


2640 
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TTTTATTACT ACAACATTTT 
TGTCACCCAG GCTGGAGTGC 
GTTCAAGCAA TTCTCCTGCT 
TGGCTAATTT TTGTACTTTT 
5 AAATTCCTGA CAAGTGATCC 
' AGCCACTGCG CCTGGCCCAT 
CACATTTCAA ATGGGTAACT 
CTCTTGGGTG TGAGAGTCTA 
AGAGCCGGGC AGGGTAAAGA 

10 TGGGCAACAC CCACTCCATC 
AGGCAAGAGA CCGAGGGCAC 
TTATACTAGA TATAGATCAT 
AGCAATTACT CTTTATTCCA 
CTAGGAAATC ATAACCTAGG 

15 GTGAGAACTG ACCAGAAGAC 
GAGGGCTCCT TGGTCTAGCG 
CCGTGGTCTA GCGGTAGCCT 
AAGAGAGTCT CCCTTTCCCC 
GAGGGCCTGA CATCAGTCAG 

20 GATGCTGTGC TTCAGTGGTC 
GCTCTGCCTT CTAGATAGCA 
GAAATAATGG CGTAAGCGGT 
AAGGGAAGGG CCCCCTGGCC 
ATGGTTCACA CTCCTTACCC 

25 ATTCGGGGCC GCTACCAGTC 
TTTTTCTTTT ATCTTTGTCT 
GAGAAAACCC ACCAACCCTG 
TAGGCACGAC TCAGCTGCTG 
GGACATCTGG CCCAGTCTCC 

30 CATGCTAATG GGCAATATGA 
TAAAAGGCAC TTTGCGGGGA 
CACCCTGTCA CCCAGGCTGC 
CTCTCAGGCT CAAGCAATCC 
TGCCACCACA CCCAAATAAT 



TGTTTTCTTT tTTTTTTTTT 
AGTGGCACAA TCTCGGCTCA 
GCAGCCTCCC AACTGGATTA 
AGTAGAGATG GGGTTTCACC 
ACCCACCCTG GCCTCCCAAA 
TTTTGTTTAT CAATAAAAAT 
CCAGTGTCCT TGATGCTCCT 
GCATGTGGGT GGTCTGGACA 
GACCCCCTAG GATGTGAAGG 
AGATGGACCC TGGTATGGGC 
GAGCTGTTCC GGTGTAATAA 
AAATATGATT ATATATGAAT 
ATATTATAAT AATCCTTGCC 
AAAAACTAGG CCATACAGAG 
AAGAGTGCGA GCCTTCTGTT 
GTAACGCCAG CATCTGGGAA 
CAGTGTCAAG GAAAAACACC 
GGGGGAGTTT AGAGAAGACT 
GCCCGCCCGC AGTTATCCGG 
ACGCTCCTAG TCCGCCTTCA 
GCAGCAAATT AGTGAAAGTA 
CTCTCTCTCT CTCTCCTCTC 
AGTGGGCACG TGACCCACAT 
TGCCGCTTTG TCTTGTATCC 
TCCGCGTCTT GGTGGTAGTG 
TGTGTCTTTA TTTCTACACT 
TGGGGCTGGT CCCTACACCC 
TAGTGAATTG CGATCCTCCA 
TCGTTGAGCC AGTTCACGAA 
GGACTAAATG GGATGGTCTC 
AAGAAACTTT TCCTTTTTTC 
AGTGCAGTGG CATGACCTTG 
TCCCGCCTCA ACCTCCCATG 
TTTTATATTT TTTGTAGAGA 



TTTGAGACTG 


AGTCTCGCTC 


2700 


CTGTAACCTC 


CATCTCCCTG 


2760 


CAGGCGCCCG 


CCACCACGCC 


2820 


ATGTTGGCCA 


GACTGGTCTC 


2880 


GTGCTGGGAT 


TACAGGTGTG 


2940 


GTACTTAATG 


TTGAACTCTC 


3000 


GCGACATGTT 


CGTGAGACTT 


3060 


GGAGGGGGAG 


GG AAGAGTGC 


3120 


CCGCCCTGCA 


TTTGTCAGAC. 


■3180 


GGCAAGCCAC 


CTAGGTGCCG 


3240 


AATGCATAAA 


ATAAGAATAG 


3300 


ATCATTCATC 


ATTAGTTTGT 


3360 


TAAGCATAAC 


CTAGGAAAAA 


3420 


ATAGGAGCTG 


AGGGGACATA 


3480 


ATGCCTGGAC 


AGGGCCACCA 


3540 


GACGCCCGTT 


GCCAAGTGGA 


3600 


CGCTACTTAG 


CAAACCAGGA 


3660 


CTACTCCTCC 


ACCTCTTGCG 


3720 


AGGCCTAACC 


GTCTCCCTGT 


3780 


TGTTCCATCC 


TGTGCACCTG 


3840 


CTGAAAGTCT 


CTGATAAGCA 


3900 


TCTCTGCCTC 


AGCTGCCAGG 


3960 


GACCTTACCT 


ATCACTGGAC 


4020 


AATAAATAGC 


GCAACCTGGC 


4080 


GTCCCCCAGG 


CCCAGCTGTC 


4140 


CTCTCATCTC 


CGCATACGAG 


4200 


TGGCTTTGTA 


GACTGGAGCC 


4260 


AACCCAGCAA 


GGCACCTGCA 


4320 


AAAGAGACTT 


TTCTGAGTGA 


4380 


CAACTTGGAC 


AAACCAACAG 


4440 


TTTTTTTTGA 


GACAGGATCT 


4500 


GCTCACTGCA 


GCCTCAACCT 


4560 


CAGCTGGGAC 


CATAGGTGCA 


4620 


CGAGGTTTCA 


CTATGTTGCT 


4680 
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CGGGCTGGTC TCAACTCCTG GGCTCAAGCA 
AGATTACAGG CAGGAGCCAC CAGGCCTGGC 
AATATTAGAA GAGATGAAAA TTCATCAACA 
AACCAGAATG GAAGCTGTGT GTACAGTGGG 
5 CAGGGCTCCA CGGTGTGGCG ACGCCCCATG 
ATGAGGACCT GGGGTGCTTT TCAACATGTA 
AAGAAAAATT CATATGTTTC TATATAAACA 
TGACAGTAGG AATCTCTAGG. TAGTAGTAAT 
TTTTCTAATT TTCTTTTTCT TTCTAAATAA 

10 ATATTTTGTA ATAAAAATGG TTATATTTAA 
AAATGGCAAA TTAGACACAC ACATGTGGGC 
GCACTTTGGG AGGCTGAGGC AGGCAGATCA 
CCAACATGGT GAAACCCCGT CTCTACTAAA 
ACACCTATAG TGCCAGCTAC TTGGGAAGCT 

15 CAGAGGTTGT AGTGAGCCGA GATCATGCCA 
ACTCCAACTC AAAAAAAAAT AAAAATAACA 
TCACAACAAT GCAGAGGGAA GAGCTTCCAA 
GTGCGCGCAG ACGCAGAGAG CAGCAAGGTG 
AGATGACTGA TGCTCAGTTT ATACCCCAAA 

20 TACCTCTGCC AGCAGGAATG AGGGAAAGGA 
ATCCAGCTAC CTGCCATCCT CAGCCCCACT 
CCCTGCCTAC AGGACCCTGA GAGCTAGGGG 
AAATTCATAT GTTGAAGGCC TCATCCCCAG 
GAGATAATTT AAATGAGGTC ATATAAGTTG 

25 GTGGTTTTTT TTTTTTTGGA GACTGGGTCT 
CACGATCACG GCTCACTGCT GTCTCCAACT 
CTCCTGAGTA GCTGGGACTA CAGGTGCTTA 
TTTTTGTAGA GATGGGGTTT TGCCATGTCG- 
ATCTGTCTCC CTCGGCCTCC TGCAGTGCTG 

30 GGACCGTGAC CTTCTAAGAA GTGAAAGAGA 
ATGAGGACAC AGCAAGAAGT CGGCCATCTG 
GAACCTGGCA GACCCTGATC TTGGACTTCA 
GCTGTTCAGG CCACGCGGTC TACGGCAGCC 
AGTCAGACCA GATCAGGAAG. AAAGGCCTAG 
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ACCCTCCCAC 


CTCAGCCTCC 


CAAAGTGCTC 


4740 


CAACATAGGA 


AGAAATTTAA 


ATTTGAATTG 


4800 


TGGAAAGACA 


AAGATCATTA 


ACTAAAGCCA 


4860 


GTCTCATGCT 


GGGAACGCGA 


GGGGCACGTG 


4920 


CTCCCTTTGT 


GGGGGTTCAT 


CCAGCGGAAC 


4980 


CGTGAGTTTA 


ATAATAAAAA 


GGTTTAAGGA 


5040 


GAACATCTGG 


AAAGATCTAT 


TCTAAGGTGT 


5100 


ATGGCCTTTT 


TGAATTTTTG 


CTTATCAGTA 


5160 


TTCTAGCTAT 


GAAATAATTT 


TCTACCATAT 


5220 


TTTTTTAAAG 


GCTGTACAAA 


CTTCCTGATA 


5280 


CGGGTACAGT 


GGCTCGCGCC 


TGTAATTCCA 


5340 


CCTAAGGTCA 


GGAGTTTGAG 


ACCAGCCTGG 


5400 


TATACAAAAA 


TGAGCTGGAT 


GTGGTGGCAC 


5460 


GAGGCAGGAA 


AATTGCTTCA 


ACCCGGGAGG 


5520 


CTGCACTCCA 


GCCTAGGCAA 


CAAGAGCGAG 


5580 


CACACGTGAA 


TAGGCTCCTC 


ATGGAAGTCA 


5640 


AGTGTAAACC 


CAGAAGCGAG 


GAGCAGGAGG 


5700 


CAGACTGAGA 


GGCGGAGGCT 


GGCCGTGGGG 


5760 


TCCGTAAATC 


TAGAGGCCTG 


GCACATCAAC 


5820 


GGGCAACCAA 


AAGATGTCCC 


ACCCTCACCC 


5880 


GGCAGAAGAC 


CCTGAGAGGT 


GGAGGCAGGC 


5940 


AAGGCGTTAT 


CCTGAACTGT 


GTCCCCCGTA 


6000 


TGTGACTGTA 


TTTAAAGATG 


GGGTCTTCAG 


6060 


GCCCTCATCC 


AGTAAGACTT 


TGACCTTCTG 


6120 


CACTCTATCA 


CTCAGGTTGG 


AGTACAGTGG 


6180 


CCTGGGCTCA 


GGTGATCCTC 


CTGCTTCAGC 


6240 


CCACCGCACC 


CAGCTGGTGG 


TGCATTGTGT 


6300 


CCCAGGCTGG 


TCCTGAACTG 


GGCTCAAGTG 


6360 


GAATTACAGG 


TATGAGCCAC 


CGCGCCTGGC 


'6420 


AAGATCTTTC 


TCTCTCCCTC 


CCTCTCCATC . 


6480 


CAAGGTAGAA 


AGCGAGTCCT 


CCCAACAGCT 


6540 


GCCTTCAGAG 


CTGTAAGAAA 


ATAACTCTCT 


6600 


CGAGCAGACT 


AAGACACACG 


CCATCTGGGG 


6660 


AGCTCAGGAT 


ACTGAAGGTC 


CCAACCCGGT 


6720 
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GCTGGACCAG 


ACCACCCCGG 


CAGCCGCGGC 


CACGGAGTCA 


CGGCTCGGGT 


GAGGTGACCT 


6780 




GGACACCATC 


CCGGCAGCCG 


CGGCCACGGA 


GTCACGGCTC 


GGGTGAGGTG 


ACCTGGACAC 


6840 




CATCCCGGCA 


GCCGCGGCCA 


CGGTGTCACG 


GCTCGGATGA 


GATGACTCGG 


ACACCACCCC 


6900 




GGCAGCCGCG 


GCCACGGTGT 


CAGGGCTCAG 


GTGAGGAGAG 


TTGGATATGG 


GACTGGGCCT 


6960 


5 


ACCCCGAGGC 


TGCTTCCACC 


CAGACGCCTG 


GGTGGGTGAC 


ACGAAAGCTG 


GGCTCAGTTG 


7020 




GGATCAGAGC 


AGCCTCTCCC 


CAGGTCAGAA 


ATGACCCTGG 


GCTCCTCACA 


GTAGCCCTAG 


7080 




GGCACCATGA 


GAAAGCTACG 


TGGACTTCTC 


TGACCAAGGG 


TCACTGCTGC 


CACACTACTC 


7140 




ATTGCAGGCC 


ATGTCAGGGC 


TCAGCTGAGG 


AGACGTGGAC 


ACCACCCCAG 


CAGCCGCGGC 


7200 




CACGGCGTCC 


CAAGGGAGGG 


ACTTGGGCAC 


TGCCTCTCTG 


GGCAAGAGTG 


GGGAGGTGTG 


7260 


10 


GGGTGGGAGA 


TGTCTGGAAA 


CATCATGGAC 


ACATGCCGGG 


AAAACACGGA 


AGCTGTGCAC 


7320 




CAAGGTGCTG 


ACAAAGGAAA 


AAGGAGAATG 


GAGGTGTGAA 


CATCCAGCTA 


GCAGGTCCCA 


7380 




CTCAGAAACT 


CCTGCATTTC 


CAGACATGGC 


CACCAGCTCT 


GTGGATGAGA 


CAGGGGAGGA 


7440 




CAGGGTACCT 


CACACCAGGA 


ACCCACACAG 


GTCCATGTCT 


TGCTCTGTGA 


TCACACAACA 


7500 




GCCTCCACCA 


CCCTGACATG 


CAGGAGGGAG 


GTCAAAGCCT 


CGGGTCCAAC 


AACAGGCTCC 


7560 


15 


ACAGCAAGGG 


AAGAAAGGCA 


GGAAGGAACT 


CAGGGCCAGG 


TCCTCCCAGG 


CAGCAGCTGC 


7620 




CTGCACGCTG 


TCCACCAAGG 


GAGGTCTGAC 


CTACACCGCA 


CAGGGGTTGG 


CAGTCTAGAG 


7680 




TCGTCCTCTG 


TCAAACGGTG 


AGAAAGTCAA 


AAGCTCATGC 


TCAGTGATAT 


GCTAGGTCAG 


7740 




CATGAAGATG 


CCACACAtGA 


GACACAGCAA GGATGAGACC 


AACGGGAAGA 


CTGCCCCAGA 


7800 




CCAGAGCCCC 


AGAGCCCTCT 


GGGGAGGAAG 


AATAAGGATG 


GCAGCCTGGG 


ACTGCCCGGG 


7860 


20 


GCTGACTCTG 


CCTTTATTTC 


ACCCCAGCAG 


AGGCAGGAGT 


GACACCGGCT 


CACAGCAGGA 


7920 




GCAGCTCTGC 


CACCTCCTAG 


CAGTTCCACC 


TACGGGCAGC 


AAAACAAAGC 


TGGCAGTTTG 


7980 




GGCAAATGTT 


AGCGTTTTTG 


CCAACTAACA 


TTTGAATCGG 


ACATCTGGTA 


CAGAGATGAG 


8040 




. GAAGAAAACA 


CTCACAGTTT 


CATGAAGACT 


GTCAAGAAAA 


TCACTGACTC 


TTCACTTCAT 


8100 




TTATGAAAGG 


CCAGCTCTCT 


GACATCCCTA 


CCACTCCCTC 


TCACATGAGA 


AATCACGGCC 


8160 


25 


TTTCAGGACG 


TGGAGCCACG 


^ TGGCCATGCA 


GGTACGGGAG 


GCCTCCCCGC 


AGCTGCAGCT 


8220 




GGGTCTTCTG 


GTCCCCGTGC 


CATTTCTGCT 


TTTCTTCGCT 


CTCTACTTAC 


ACACACATTT 


8280 




GAGTCCAGTC 


TCAGAAGAAC 


TGGAACTAGA 


AAAATCCTGA 


CACTTGTCCC 


TTACTACGTT 


8340 




AATGCCAGCT 


GTGCCAAGGA 


CAGCCCAACC 


CAAGCCCCCA 


TCAGCCCCAA 


TGGCACCGAG 


8400 




GCCCGAGCTT 


ACCCGTGAGG 


GGCCAAGTTG 


GTCGTCACCA 


ACACGGTCTT 


CACCCCGTCC 


8460 


30 


ACACCACTGC 


CGTCCACTGC 


AGTGTCCGGA 


GTTGTCACAA 


CCACCACCTC 


CTCCATGTGC 


8520 




ACACTCACGT 


CGGGAGTCGC 


CATGGCTCAG 


CGGAAGGGGA 


CGCCCAGGCC 


AGCAGCGTCA 


8530 




GTCCTCCAGG 


GTCCCAAGTC 


CTGGAGGAAG 


CAAGGGAGGG 


CACAGGGATG 


GAGTCATCTC 


8640 




CACATCCACA 


CAACATAGCA 


CTCACAAAGG 


CATCTCTAAT 


CAGCTCCAAA 


GACCCACCCT 


8700 




TGAGTCCCAG 


ACTGCTACCT CCTGACAAAA ACGAGCGGCA ACAGAAGGGC. 


TACTCCAGGC 


8760 
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TCTGGTTCCG AGGGCGGTGT 
CCCTTGGGTA GAAAGCACAG 
AGGGGGCTGG AATTCCAAGC 
GTCTCTCCTG GCCTGAACTC 
5 AAGCACTCCA AGAAGTCCTC 
AAAGAGAGGA GGGAAACGCC 
TCTCGCTCTG TCGCCCAGGC 
CCTCCTGGGT TCAAGTGATT 
CACCATGTTG CCCAGGCTGG 

10 TCCCAAAGTG CTGGGATTGC 
ATTATTATTA TTTTGTCTTT 
CGGTGATAAT GGCTGTCCTG 
AAAGGAATGG TGGCCCAAGA 
ACCTAGCACA GGGAACACAA 

15 CAGAGTGTGG GGGAGGCGAC 
. GTGAGCGCCT GGGTCGTCCC 
GAAGAAAACT GACCTACAGG 
ATATGGGACA GATCCACAGT 
GACACAGGAA CGGGGTGCAA 

20 GAAAAAAAAA AGTCTACATT 
GTAGTCAAAG CATCCAGAAC 
CAAAGGAACA AATACACAAG 
TGCTATCGGA CCAGCTCCAA 
TGGTATAGAA ACAGGAGTTA 

25 AGAAAATACA CAAAGCTGGG 
TGAGCCTAGG AGATTGAGGC 
CAACAGAGTG AGAACTCTCT 
GGGGCTGGGG GAGGGAGTAC 
ATGATGAAAA GCTCTGGAAA 

30 GAATGGTGCT GAACCACACA 
CCTGTAATCC CAGCACTTTG 
GGCCAACACA GTGAAATCCT 
CAGGCACCTG TAGTCCCAGC 
AGGCAGAGGT TGCAGTGAGC 



AAGCGCACTC CACCCGTTTT 
AGCCACTCCC TCCACGTGGG 
AACTTCCCTG GACGCAGGCT 
AAAGCCAGCC CCAATCCCTG 
GCTGGTCTGT GGAACGGGAA 
CTGGGTTTTT TTGGGTTTTT 
TGGAATGCAG TGGCACGACC 
CTCCTGCCTC AGCCTCTCCA 
TCTCAAACTC CTGACCTCAG 
AGGTGTGAGG CACCATGCCT 
CCTGGCTTTG CCTTCAGCAA 
GCCCAAAAAG CTTGGAGACA 
GTGTGGGGGC TCCCTGTTGC 
AAGCCCCTGG TTTCCAGCCA 
AAGAGGAGAG GGGAGAAGAG 
AAGACAGTCT CTACGTGGTC 
ATAGGCCACT GCCCAGGTCT 
GCACTGGAAA GTCTCTAAAA 
AATTTGCAGC CTGAACCTAA 
CTCTGTGAAA CTTAAAACAA 
ACGATCAAAC TTCCTGGCAA 
AGAAGCTGGC TCTTGAACGC 
GAGGAAAGCA AACATTGTCA 
TAACCAAACA GAAATGTGAA 
ATAGTCTCAG CTACTCGGAA 
TGCAATGAGC TGTGATCACA 
CAAAAAACGA AAAAGAAAGA 
AGGGAGTTGT TCTTTAATGA 
TGGACGGCGG TGATGGCTGC 
TTTAAAAACA GTTAAAATGG 
GGAGGCGGAT CGCCTGAGGT 
GTCTTGACTA AAAATACTAA 
TACTTGGGAG GCTGGGGCAG 
CGAGATCGTG CCACTGCACT 



TCCCACTGGA 


TAAGCCGAAA 


8820 


GCTCAGAGCA 


GGAGGACAGG 


8880 


CCCGGCTTGC 


CAGTTCTTCC 


8940 


AACTGAGTTT 


CAGGTGCAGA 


9000 


GGGAAACCCA 


TTCAAGACAG 


9060 


GGGTTTTTTT 


TGAGACGGAG 


9120 


TCGGCTCACT 


GCAAGCTCCA 


9180 


ATTGCTGGGA 


TTACAGGTTT 


9240 


GTGATCCACT 


CACCTCGGCC 


9300 


GGCCTGCCCC 


GGGTTTAAAA 


9360 


GTCCAACCCC 


TGCTAAAACC 


9420 


GGGGAATCTT 


CCTCCTGACT 


9480 


CCTCTCACTC 


TCCATCCCCT 


9540 


GAGGGCAACG 


AGCCTGGAGT 


9600 


AGGATGGCAC 


ACAGCTGTGT 


9660 


CTGACCCTAA 


AGGGCAAAGG 


9720 


CAGATGGGCC 


CCAGTGGCGC 


9780 


TAAACTGGCC 


TAAGAACACA 


9840 


CCAGGTCGAT 


TTCTTGCTAT 


9900 


GACCTAGAGT 


CCATAGCACA 


9960 


AGGGTAGTCT 


GGTTGATTCT 


10020 


AGAATCCAGA 


GACTTTCAGG 


10080 


ACCAAGTGGA 


AAGAAAATCT 


10140 


AATTAAAAAC 


GACAACCAAA 


10200 


GGCGGGGCTG 


GAGGATCGTT 


10260 






X V ^ ^ u 


AAGTAGAACA 


GAAGTGACCA 


10380 


GTACAGAATT 


TCTGTTTGGG 


10440 


ACAATCACTG 


TGGCTGTTCT 


10500 


GCTGGGCGTG 


GTGGCTCACG 


10560 


CAGGAGTTCG 


AGACCATCCT 


10620 


AAATTAGCCA 


GGCATGGTGG 


10680 


GAGACCTGCT 


TGAACCCAGG 


10740 


CCAGCCTGGG 


CAACAAGAGC 


10800 
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GAAACTCCAT 


CTCAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAGTTTAAAA 


TGGTTAAATT 


10860 




TTATGTTATG 


TATATTTTAC 


CGTAATAAAA 


ACACTGTAAT 


GCTACTATAA 


TAGAATGACT 


10920 




CATTAGGATT 


AGATATAGAC 


TAGAAAGTAC 


AGAATATAAA 


AACTTTTTAA 


AGAAAGAAAA 


10980 




ATTTTCATGG 


CCAGGCATGG 


TGTCACACCT 


GTAATCCCAG 


GACTTTGGGA 


GGCCAAGGCA 


11040 


5 


AGAGGAATGC 


TTGAGCTCAG 


GGGTTTGAGA 


CCAGCCTGGG 


CAACACAGCA 


ACACCCCATC 


11100 




TCTGCTAAAT 


AAATAATAAA 


AAATAGCCAG 


GCATGGTGGT 


GTGCACGCCT 


GTAGTTGCAG 


11160 




CTACTCTGGA 


GGCTGAGGCA 


GGAGGATCAC 


TTAAGCCCAG 


GAGGTCAAGG 


CTGCAGTGAG 


11220 




CCATGGTTGT 


GCCACTGCGC 


TCCAGCCTGG 


GCAACAGATC 


AAGACCTTGT 


CACAAAAAAA, 


11280 




AGAAAGAAAG 


AAAAGAAAAA 


AGAAAGAAAA 


TAAAATCTTC 


CAGAACTTTT 


AAAATCATCA 


11340 


10 


TTGTTAATAT 


AAAAATAACA 


TCACCTGCCC 


CTAGGACTGT 


AACAAACAAG 


TGTGTCTAAG 


11400 




GACAGGAGTG 


GGTCCACCCC 


AACCTGGCAC 


GCAGTGGTCC 


CCTGCGGAGA 


GTCTGGCCCT 


11460 




GCACTCACTA 


AGAGGAGGCA 


CTCATAGCCC 


AGCCAGGCCT 


CTGCAATTAT 


GCCTTCAATG 


11520 




CCAGAACTAA 


CTCACCCAAA 


CTGAACAATC 


GATCACAAAA 


TGTGCCTTCA 


GGTCTCAAGG 


11580 




TTCTTGCTAA 


ATCTTACTCA 


ACCGACATTT 


TCCAGCATGG 


GAACATTTTT 


CTGAATGTCT 


11640 


15 


TAGGGAGAGG 


AAGTCCGCAA 


GAGAACAAAA 


GGTCCTCAGG 


CCACCCTAGC 


TTCTTTTCCT 


11700 




CCATTCCACA 


GGCTGTCTTT 


TGTCTGGGTA 


TGCACTGGAC 


CAGGGGGCTC 


TACTTCTTCC 


11760 




TACCTGGGCA 


TGGGTCTCCA 


CACAACTCCA 


AGGTAAAGGG 


CCACAGGCAA 


GATAAAGGGG 


11820 




. AGAAAAGAAA 


GCTACGATTT 


CCTGGGCCAC 


CAATCGCAAA 


TGGCAGCCAG 


TCTCTGAAGT 


11880 




■ AACCCTTGAC 


CAGAGATCCA 


AGGAACCAAG 


AAATGTAGGT 


GATCTGAACA 


GAGGGGATGG 


11940 


20 


TGGTTAAACA 


CCATGAAGGA 


AAGACCCATT 


CTCAAAGAAA 


AGGAAGCAAA 


AAGAAACCGT 


12000 




GGGGAGCTGG 


GTACCACCCG 


CAGCAAAGAC 


CCCGCACGCG 


TTACTGACGC 


CAGCCTGGCC 


12060 




TGGGAGAGCA 


GTGAGTGTGG 


CGGACGGTGA 


GTGGCGGGGA 


GGGCTGTGGT 


AGGTTTAGGG 


12120 




TAAGAAGGGG 


CAGCGCCCAG 


AGCCCAGAGA 


ACACCAGTGA 


GGGCTCCACA 


GGAACACTAC 


12180 




TCAAAGTATT 


CACGGAACAC 


ATCTAAACAC 


AAGCACTAAG 


GACTAAGTGC 


GAGGGACAAG 


12240 


25 


AAAATATTCC 


CCGTTTCCTG 


TTTCAGGAGG 


GTATCGAAAA 


TGAGTGATGG 


AAGGAAAATG 


12300 








AAAATTTTTA 


CAAATTAAGA 


ACATCCTGGA 


ACATGATGAG 


12360 




CCGTTTACTG 


TCACTCAATT 


TAAATGGTGG 


CCATCTAGGA 


CAGAGCGCCT 


AAGGGGAAAG 


12420 




GGGGCTCACA 


GGTGAACCCC 


TCCAGCTGCT 


GGTGGGCAAT 


TTCCCATTAG 


GGCATCAGGG 


12480 




TCTCTGAAGA 


CTGTCTTCAG 


ATGCTTTTTA 


GCCAGGAAAG 


TTACAATGAT 


GAATTCGTTT 


12540 


30 


. ACACTGGCGG 


AATTACTTCG 


TATTTCTCAA 


ATATAATGTT 


TTCACTAGCA 


TAACTTTGTT 


12600 




GTTGTAGACT 


TAGGCTTCAA 


AATAAAGAAC 


TTTAAACAAA 


CATGAATAAA 


AAGCCACTTT 


12660 




AGGCCGGGCG 


CGGTGGCTCA 


CACTTGTAAT 


CCCAGCACTT 


TGGGAGGCCG 


CGGCGGGTGG 


12720 




ATCATAAGGT 


CAGAAGTTCA 


AAGACCAGCC 


TGATCAATAC 


GGTGAAACCC 


CGTCTCTACT 


12780 




AAAAATACAA 


AAATTAGCCG 


GGCGCGGTGG 


CAGGTGCCTG 


TAATCTCAGC 


TACTTGGGAG 


12840 
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GCTGAGGCAG 


GAGAATCGCT 


TGAACCTGGG 


CAGCAGAGGT 


TGCAGTGAGC 


CAAGATCATG 


12900 




CCACTGCACT 


CAAGCCTGGG 


TGACAGAGTG 


AGACTCTCTC 


TTAAAAAAAA 


AAAGCCACTT 


12960 




TAAAATTTTA 


CTCAGGCGAG 


GTGTGGTGGC 


TCACGCCCAT 


AATCCTAGCA 


CTTTGGGAGG 


13020 




CCGAGGCGAG 


CAGATCACCT 


GAGGTCAGGA 


GTTAGACCAG 


CCTGGCCAAC 


ATGGTAAAAC 


13080 


5 


CTTGTCTCTA 


CTGAAAACAC 


AAAAATTAGC 


TGGGCGTGGT 


GGTGTGCCCA 


TGTAATCCCA 


13140 




GCTACTCAGG 


AGGCTGAAGT 


GAGAGAACTG 


CTTGAACCCG 


GGAGGCAGAG 


GCTGCAGTGT 


13200 




GCCAAGACTG 


CACCACTACA 


CTTCAGCCTG 


GGCGACAGAG 


CAAGACCCTG 


TCTCAGAAAA 


13260 




AAAAAAAATT 


CAAAAATTTG 


GCCAGGCGTG 


GTGGCTCACG 


CCTGTAATCC 


CATCACTTTG 


13320 




GAAGGCCGAG 


GCGGGTGGAT 


CACCTGAGGT 


CAGGAATTCA 


AGACCAGCCT 


GGCCACCATG 


13380 


10 


ATGAAACCCT 


GTCTCTACTA 


AAAATACAAA 


AAAAAAAAAA 


CAAATTGGCC 


GGGCATGGTG 


13440 




GCGGGTGCCT 


GTAATCCCAC 


CTACTTGGGA 


GGCTGAGGCA 


GGAGAATCTC 


TCGAACTCCG 


13500 




GAGGCAGAGG 


TTGCAGCGAG 


CCAAGATTGT 


GCCACTGCAC 


TCCAGCCTAG 


ACAACAGAGC 


13560 




GAGACTCTGT 


CTCAAAAAAA 


AAAAAATTAA 


AATTAAAAAA 


TAAAAATTTC 


ATTTAAAATA 


13620 




CTACTGATCT 


CCCGTGCTGA 


CTTCTCGGGG 


TTTAACTCTC 


ACTGAGGAGA 


CGCTGCTTTC 


13680 


15 


ATAAGGGTAA 


GCTCAGCAGG 


GGCAACTAAA 


GTCATTTAAG 


CAGAGAGCTG 


CAAAGAGGCA 


13740 




ACAGCCTCAC 


TGCAGGCAGG 


GGTCCTCGTC 


ACAGCTTCAG 


GGCTTTGCAG 


AGGATTACGC. 


13800 




AATGTACACG 


CACAAAACTG 


AATTCCAGCC 


TCTCCATTGG 


CAACTGCATA 


CATACATATA 


13860 




TTCTTTTTTT 


GAGACGGAGT 


CTCGCTCTGT 


AGCCCAGGTT 


GGACTGCAGT 


GGCCCGATCT 


13920 




CGGCTCAATG 


CAAGCTCTGC 


CTCCCGGGTT 


CAAGCGATTC 


TCTTGCCTCA 


GCCTCCTGAG 


1398.0 


20 


TAGCTGGGAT 


TACAGGCGCC 


CACCACCACG 


CCCGGCTAAT 


TTTTGTATTT 


TTAGTAGAGA 


14040 




CGGGGTTTCA 


CCATGTTGGC 


CAiGGACAGTC 


TCGATCTCCT 


GACCTCGTGA 


TCCGCCCGCC 


14100 




TCTGCCTCCC 


AAAGTGCTGG 


GATTACAGGC 


GTGAGCCACT 


GAGCCTGGCC 


TCCAATGGCA 


14160 




ACTATATTAA 


AGGTTCAAAG 


CAATATGCAC 


AAAAGTTACC 


TCACAGAAAA 


. TAGTGCAAGT 


14220 




CCTTGATACA 


ATGCTCTTTA 


GACACAGAAG 


AAGCACTATA 


GAATAGAGCA 


CCTCGCCCTA 


14280 


25 


TTGCCTTCCC 


AAGGGCGAGC 


ACCCCCTCCT 


CTCTCCACAG 


CTCCTTCTTT 


GTTTTTTTGA ' 


14340 




GATGGAGTCT 


CGCTCTGTCA 


CCCAGGCTGG 


AGTGCAATGG 


CAAAATCTTG 


GCTCACTGCA 


14400 




ACCTCCGCCT 


CCCGGGTTGA 


AGTGATTCTC 


CTGCCTCAGC 


CTCCCGAGTA 


GCTGGGACTA 


14460 




CAGGCACCCA 


ACACGCCTAG 


CTAATTTTTG 


CATTTTTGGT 


AGAGACGGGG 


TTTCATCATG 


14520 




TTGGCCAGGC 


TGGTCTCGAA 


CTCCTGACCT 


CCAGTGATCC 


TCCCACCTTG 


ACCTCCCATA . 


14580 


30 


GTGCTGGGAT 


TATAGGTGTG 


AGCCACTACA 


CCTGGCCTCT 


CCACAGCCCC 


TTCTGTGTTG 


14640 




AAGCCAAGAC 


CCACCCAGCT 


TTGATCCCAA 


GGCTTGGGTT 


CCCCACTAGT 


GTGAAGTGAG 


14700 




TTTCCAAATT 


ATTAGGTAAA 


TCAGATATGA 


GAAAATATTT 


TATTTTACTT 


TTTTTTTTTT 


14760 




GAGACGCAAT 


CTTGCTCCGT 


CACCCAGGCT 


GGAGTGCAAT 


GGCACCATCT 


CCACTCACTG 


14820 




CAACCTCTGC 


CTTCTGGGTT 


CAAGCAATTC 


TCCTGCCTCA 


GCCTCCCAAC 


TAGCTGGGAT 


14880 
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TACAAGTGCA CACCACCACG CCCGGCTAAC 
CCGTGTTAGC CAGGCTGCTC TCAAACTCCT 
AAAGTGGTGG GATTACAGGT GTGAGCCATC 
AGTATTCTTG ACCGGCACGG TCAACACTGA 
5 AGCAAAGAAA GAGAATTCTG GTTCAACAGA 
GAATTACACA GTAACCAAAT AGATAACATG 
AGCTCCGACC AACAGGGTCC ACACCACCAT 
AGCCTACAGT GACGCGGGCC CCTGTGGGGT 
GTCCCAGTGG GGTGATCCCT GCGGGTCGCG 
10 CCCTGCGGGT CGGGTGCCTG CGGGGTGGTC 
CCTGCGGGGT GGCCCCTGGG AATCGCGTCC 
TGGGGATCGC GTCCCTGCGG GTCGGGTGCC 
GCGGGTCGGG TGCCTGCGGG GTGGTCCTTG 
GGGTCGCGTC CCTGTGGGGT GGCCCCTGCG 
15 CCTGCGGGGT GGTCCCTGTG GGTCGCGTCC 
TGCGGGTCGC ACCCCTGCGG CGTGGTCCCC 
AGGCCGAGCC CGCGCCCGCC CGCGGCGCCA 
CCCGCCACCG CCCGCGCCGC CCGCCTGACG 
CGGCCGCCCC TCCCCCGGCC CTCCCCTCCC 
20 ACCCCTGACT GGACGGCGGC GCGTGAGCGG 
GGCTCGCCGG CGCCGCTTAC CTGGGGCCGC 
GGCGGCGTCG GGAGCTGCGG CGGCGGCGGG 
GTTGGGGATT CGGCGGCGGG GGCGGCGCGG 
GGCCGCGCAC GCACGGGGCT GGGAGGGCCG 
25 GGCGGGGCCG TGAAATAAGG CCCGACGGGC 
AGCTCCTAAC GCCGCAGGTT CCTCCTGGTC 
CCGCGCGGGC GGCGGGGCCG AGGGACGATG 
GCCCACGCGC GGCAGGGCGG TACCTTCAGG 
GCTGCTGTGT GTGAATGGGC GCGAGGGGAC 
30 GCTGTGGAGG AGGACGCTGT AGGGTGCGCG 
CAGCCACGGC GCTCCCAGCG TGGAGGGCGA 
GTCACCCCTC AAGCTGTCAC CCCAGTCCCA 
ACAGCCGACA CCCCGATCCC ACCCCTGCCC 
TAACACCCCG GTCCCACCGC TGTCCCACGG 
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TTTTGTATTT 


TTAGTAGAGA 


CAGGGTTTCA 


14940 


GACCTCATGA 


TCCGCCCACG 


TCGGGCTCCC 


15000 


ACACCTGGCC 


CAAGAAAATA 


TTTTTAAACT 


15060 


TGTAATTGAA 


ACTGTTGTAT 


TTGAAGTGTT 


15120 


AAAGTCAGTC 


ACGACTTTTC 


AGTCACGCAT 


15180 


CCATGACTGA CGACGGGCCC ACAACAAATC 


.15240 


GGGTCTACAC 


AGATCCAGGT 


CCCGCCTGTG 


15300. 


GGTCCCTGCA 


GGTCAGGTCC 


CTGAGAGTGG 


15360 


TCCCTGCGAG 


TTGGGTGCCT 


GCCGGGTGGC 


15420 


CCTATGGGTC 


GCGTCCCTGC 


GGGTCGGGTG 


15480 


CTGCGGGTCG 


GGTGCCTGCG 


GGGTGGCCCC 


15540 


TGCGGGGTGG 


CCCCTGGGGA 


TCGCGTCCCT 


15600 


TGGGTCGCGT 


CCCTGTGGGG 


TGGTCCCTGT 


15660 


GGTCGCGTGG 


TGGCCCCTGC 


GGGTCGGGTG 


15720 


CTGCGGGTCG 


GGTGCCTGCG 


GGGTGGTCCC 


15780 


CCGGGATGGG 


TCCACCGAGG 


AGGCCGCTGG 


15840 


AGATGGAGGC 


AGGAAGCGCC 


GCCGCCCGCG 


15900 


CCGCCGTTGC 


GCCTGACGCC 


GCCGCCCGCG 


15960 


CCCGCCGTAA 


CGTCCTGACG 


CTCCGCAGGG 


16020 


AGCGAGAGGC 


CTCGCCGCGG 


GGGGGCCGCG 


16080 


GCCGGGCCTG 


CTTAGGCACC 


CGGCGGGGGC 


16140 


CGGCGGCGGG 


GGCCGCGGGC 


TTCGCTCCTT 


16200 


GCGCGCGCTT 


CCTAGTGACG 


CAGGCGGCGG 


16260 


GACACTTATT 


TGGCGCTCGC 


GGAGGAGGAA 


16320 


CCCGGGGCGC 


GTGCGCGGAC 


CGACACTGTC 


16380 


CCCGAGGCCC 


CCGGTCGGGC 


GTTGCCTGCC 


16440 


GTCAGTGGAC 


GGACGGCGGC 


AGGGAGCAGT 


16500 


CCTCCAGGTA 


CGGGCGCTCC 


TCGCCCGGAC 


16560 


TCCCCTGCGG 


GGCGGACGCC 


TGAACACGAG 


16620 


GACTCACGCG 


GAACATGCCA 


GAGGCTCAGC 


16680 


GGGGCATCCG 


GGAGCGGCCG 


GGAGGGCTCG 


16740 


CAACCAGCAC 


CCCGATCCTA 


TCGCAGTCCC 


16800 


AACAGCCGGC 


ACCCACCCCA 


ATCCCATAGC 


16860 


CCGGCACCCC 


GATCCCACCC 


CAGTCCCGCA 


16920 
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GCTGGCACCC CGATCCCACC CCAGCCCAAC 
CCCACAGCCG GCACCCCGAT CCCACCCCAG 
GGCACTCACC CCGATCGCAT AGCATAGCTG 
AGCACCCCGA TCCCACCCCA GTCCCATAGC 
5 ATCACGCCCC AGTCCTATAG CCCGCACCCC 
CATCCCACCC ATGTCCCACA GTCGGCACCC 
GATCCTGTGG CCCTCCTCCA GCCCCCAGGG 
. GGGCTGGCTC TGTTGGCCTC TGTGCTGGGT 
CTGACCTCTC CAGGTGTCAT CGAACACCCT 
10 • AGGATCCCGG ATTCCGTATC AGGGGACCGA 
GCTTGGATGG AACCCTGACC CTGTGCTCAC 
TCCCGGAACG GGAGGTTCCC AAGCCACTGC 
ATTCATTCAG TTCCCAGTAA AGTAGAACGT 
AAAATTAGAT TGGCCGAGTG CGGTGGCTCA 
15 AGGTGGGAGG ATCACCTGTG GCCAGGAGTT 
CTGGGGAAGA ATAAAAAAAA AAATTGAGCC 
AAATATGAGA GGGTTCACGC GGTCTATGTG 
GTTTTGGAAA TAAATGTCAA TAAAGTCGAA 
AGGGTGGCTA CATCATGCCT GTGTGTTGCG 
20 • CCTGGGGCCT GTTCTGCCTC CTTCACTCTG 
GAGAAACTCC ATCTAAAAAA AAAGAAAAAT 
TGGCATAACA AATTATCTCA AACTTAGTCG 
AGGCTCACGC CCATAATCCT AGCACTTTGG 
AGTTCGAGAC CAACATGGCC AGAACTGTCT 
25 GGTGGCTCAC GCCTATAATC CCAGCACTTT 
AGGAGATAGA GACCATCCTG GCTAACACTG 
AATTAGCCTG GCGTGGTGGC AGGCGCCTGT 
AGAATGGCGT GAACCCAGGA AGCGGAGCTT 
CAGCCTAGGC AACAGAGCAA GACTCCGTCT 
30 ATTCTCTAAC TGTTCTGTAG GTCGGAAGTC 
■ TGTCTGCAGG GCCGATTGCT TTTGGAGCTC 
TTCTGGAAGC ACTTGAGCCC CTTGTCTCGT 
GCCAGTTGAG TCCTCACCCT GTTGGCCCCG 
TCAAGGCCCC TTGTGATGAC ATGGGGCCAC 



AGCTGGCACC 


CACCCCGATC 


CCACCGCTGT 


16980 


TCCCGCAGCC 


GGCACCCCGA 


TCCCACAGCC 


17040 


ATACCCCGAT 


CCCACCCCAG 


TCCCATAGCC 


17100 


CAGCACCTCG 


ATCCCATAGA 


TGACACCCCG 


171'60 


GATCCCACCC 


GAGTCCCGCA 


GCCGGCACCC 


17220 


CGATCCCACT 


CGGATCCGGC 


AGCCAGCTTG 


17230 


CTCATTTATA 


TGTTTTATTG 


GCAGAGGCTG 


17340 


TTCTTCCTCT 


GCACCGCAGG 


ACTGGCTCTC 


17400 


TGTGCTTGCT 


GTCACCCGCT 


GCCTGTCTGC 


17460 


AATTAGTCGG 


AAAATAGGAA 


GCAGGTGCTC 


17520 


ACTTGTAGGA GGAGGGCTCT 


GCAGGCCGCC 


17580 


ACTTCGGAGG 


GGCTGTAATT 


AGAGTTGCAC 


17640 


GCTCCAGCCA GTGAGGAAAA GGTGTTTTTA 


17700 


TGCCTTTTAC 


CTCAACACTT 


TGGGAGACAA 


17760 


caagaccagc ctgggcaaca 


GAGCCTGTCT 


17820 


TTTGTCAGTG 


CTACTATTTT 


ATTATCTGGT 


17880 


TGTCATTTAT 


CTGAGTTTGC 


CTATCGTCAC 


17940 


GAGGAGTGCT 


GAGGGGGGCC 


TGGGGATGGG 


18000 


CAAGCCCACC 


GAGGTCGGCC 


TGGGGTGAGC 


18060 


GGGCTCCAAG 


AGACAAACTG 


GGCAACAAGA 


18120 


CACCTCCAAG 


ATAACTTAGC TTTCTTCTGC 


18180 


CTTAAAAATG 


. CAAATTTAGG 


CTGAGTGCGG 


18240 


GAGGCCAAGG 


CAGGATTGCT 


TGAGGCCAGG 


18300 


CTTTTTAAAA 


AATGCAAATG 


TGTCCGGCAC 


18360 


GTGAGGCCAA 


GGCGGGCAGA 


TCACGAGGTC 


18420 


TGAAACCCCC 


TCTCTACTAA 


AAATACAAAA 


18480 


AGTCCCAGCT 


ACTCGGGAGG 


CTGAGGCAGG 


18540 


GCAGTGAGCC 


GAGATGGCGC 


CACTGCACTC 


. 18600 


CAAAAAATAA 


ATAAATAAAA 


CTGCAAATGT 


18660 


CAGCCCAGCC 


TCACTCCGCC 


AAAATCAGGG 


18720 


CAGGGGAGAA 


GCTGTTCTGG 


CCTTTCCAGT 


18780 


GGCCTATCCC 


ACACCTGAAA 


GCCAGCCAAA 


18840 


ACACTGATCT 


CCTGCCTCCC 


TCATCTGCTG 


18900 


CAGCTGGCCC 


AGGGCACCTC 


CTGTCAGAGT 


18960 
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CCGCCGACCA GTGACCTTCA TTCCATCTGT 
GTTCACAGAT CCCAGGGGTT AGGATGTGAA 
CTTCCTTCTA AAATATTTAT CATTTTTGTT 
AGACAGAGTC TCGCTCTGTC GCCCAGGTTG 
5 AACCTCTGCC TCCGGGCAGA GGTGAGCCAC 
TTTGTTTTGT TTTTGAGATG GAGTCTCGGC 
GCACTTTGGG AGGCCGAGGC GGGCGGATCA 
AACACGGTGA AACCCCGTCT CTACTAAAAA 
CGCCTGTAGT CCCAGCTACT CGGGAGGCTG 

10 GGAGCTTGCA GTGAGCCGAG ATCGCGCCAC 
TCCGTCTCAA AAAAAAAAAA AAAAAAAAAA 
CCAGGCTGGA GTGTAGTGGC GGGATTATAG 
TATTTTTAGT AGAGACAGGG TTTTACCATG 
CAGGTAATCC ACCCGCCTCA GCCTCCCAAA 

15 ' , TCTGGCCATA TTTATTAACT ACAAAGGGAA 
CTCACTCTGT CACCCAGGCT GGAGTACAAT 
CTCeCAGGTT CAAGCGATTC TCCTGCCTCA 
CGCTACCAAG CCCAGCTAAT TTTTGTATTT 
GAGGCTGGTC TCGAACTCCT GACCTTGTGA 

20 GATTATAGGC ATGAGCCACT GCAACCGGCT 
GGGTTGGCTG GGCATGGTGG CTTATGCCTG 
GAGGATCGCT TGAGTCCAGG AGTTTGAGAC 
CCGCAAAAGC TAAAAAGTTA GCCAGGTGTG 
GGAGGCTGAG GTGGGAGGAT CACTTGAGAC 

25 TGTGCCACTG CACTCAGCCT GGGCAACAGA 
GTCCAGGCAC AGTGGCTCAC ACCTGTAATC 
TTCGTTGAGG TCAGGAGTTC AAAACGAGCC 
AAAATACAAA AAAATTAGCC AGGCATGGTG 
GACTGAGGCA GGAGAATCAT GTAAACCCAG 

30 GCCACTGCAC TTCAGCCTGG GCGACAGAGC 
TTGAGTCCTG AGTTCCTCTA AGGGAAATTC 
GAACAATCTG ATGAGGAAGA AGATAGAAAC 
AGTGGCTCAC GCCTGTAATC TGAGCACTTT 
AAGAGATCAA GACCATCCTG GCTAACATGG 
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CGCTGTAATT 


CCCCTTTGCT 


TGGAACCAAC 


19020 


TATCTTGGGC 


AGGGCTGTGG 


GGGGGCTATT 


19080 


TTGGGGATTT 


TTTTGGTTTG 


GTTTTTTTTG 


19140 


GAGTGCAATG 


GTGCAATCTC 


AGCTCACTGC 


19200 


TGCACCAGGC 


CTGTTTTTGT 


TTTTGTTTGT 


19260 


CGGGCGCGGT 


GGCTCACGCC TGTAATCCCA 


19320 


CGAGGTCAGG 


AGATCGAGAC 


CATCCTGGCT 


19380 


TACAAAAAAT 


TAGCCGGGCG 


TGGTAGCGGG 


19440 


AGGCAGGAGA 


ATGGCGTGAA- 


CCCGGGAGGC 


19500 


TGCACTCCAG 


C'CTGGGCGAC 


AGAGCGAGAC 


19560 


AAAAAAAGAG 


ATGGAGTCTC 


ACTTTGTCAC 


19620 


GTACGCGCCA 


TCATGCCCAG 


TTACTTTTTG 


19680 


TTGGTCAGAC 


TGGTCTCAAA 


CTCCTGATCT 


19740 


GTGCTGGGAT 


TACAGACGTG 


AGCCACCGTG 


19800 


AGATGATAAT 




GAGATGGAGT 


19860 


AGCGTGATCT 


TGGCTCACTG 


AAACCTCTGC 


19920 


GCCTCCCAAC 


TAGCTGGGAT 


TACAGGCGCA 


19980 


TTAGTAGAAA 


CGGAGTTTCA 


CCATGTTGGT 


20040 


TCTGCCCACC 


TCGGCCTCCC 


AAAGTGCTGG 


20100 


GAAAGATGGT 


AATTTTAAAG 


TAGAGAAACT 


20160 


TAAGCTCAGC 


ACTTTGGAAG 


TCCAAGGCAA 


20220 


CAGCCTGGAC 


AATATAGCAA 


GACCCCATCT 


20280 


GCGGCACATG 


CCTGTAGTCC 


CAGCTACTCA 


20340 


CAGGAGGTCA 


AGGCTGAAGT 


GAGCTGTTAT 


20400 


GCGAGAGTCT 


GTCTCCAAAG 


GTAAAAAAAG 


20460 


TGAGCACTTT 


GGGAGGCCGA 


GGCGGGCAGA 


20520 


TGGCTAAATG 


GTGAAACCCC 


GTCTCTACTA 


.20580 


ACGGGCGCCT 


GTAATCTCAG 


CTACTTGGGA 


20640 


GAGGCTGAGG 


TTGCAGCGAG 


CCAAGATCAT 


20700 


AAGACTGTCT 


CAAAACAAAA 


CAAAAGAATC 


20760 


CAGGCACCTC 


GCCACCCTTG 


ACAGGCAAAG 


20820 


AGCTTAAACA 


ATAGTCTCCC 


GGCCGGGGGC 


20880 


GGGAGGCCGA 


GGCGGGTGGA 


TCACAAGGTC 


20940 


TGAAACCCCG 


TGTCTACTAA 


AAATACAAAA 


21000 
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AATTAGCCGG GCGTGGTGGT GGGTGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG 21060 

AGAATGGCGT GAACCCAGGA GGCGGAGCTT TCAGTGAGCT GAGATCGCGC CTCTGCACTC 21120 

CAGCCTGGGC :GACAGAGCCT CGAGACTCCA TCTCAAAAAA AAAAAAAAAT TAGCTGGGTG 21180 

TGGTGGCTCA CACCTGTAAT CCCAGCTACG TGGCAGGCTG AGGCAGGAGA ATCGCTTGAA 21240 

5 CCTGGGAGGC GGAGGTTGTA GGGAGCTGAG ATCGCACCAC TGCACTCCAG CCTGGGCAAC 21300 

AGAGCGAGAC TCTGTCTCAA AAAAAAAAAA AAAAAACAAA AAAACAATAG TCTCCCAAGT 21360 

AAGTCAGAGT CACAAGGTGT TTTGATTCCC TGTGGAAACT AAAATATAAC AGCTTAACAT 21420 

ATGTTCTTGA GTTATTTTTC AGAAACTTGG ACATCCACCA GGTGGAAAAT GCTGAGCTAG 21480 

- GAACAGTGGC TATAATTTCA GCCTTTTGAG AGGCCAAGGT GGAAGGATCA CTTGAGGCCA 21540 

10 GGAGTTAGAG ACCAGCCTGG CCAACATGGT GAAACCCCGT CTCTAGTAAA AATACAAATA 21600 

rrAGCTGGGC ATGGTGGTGC AACCTGAAAT CCCAGCTACT TGGGAGACCT AGCTGGGAGG 21660 

ATCGCTTGAA CCTGGTAGGA GGAGTTTGCA GTGAGCTGAA ATTGTGCCAC TGCACTCTAG 21720 

CCTGGGCAAC AGAGTGAGAC TCTGTCTCAA AAAATAAATA AATAAAAAGA GAAAAAAGTG 21780 

TTGCCTGCAG GCCGGGCACA GTGGCTCACG CCTGTAATCC CAACACTTTG GGAGGCCGAG 21840 

15 ATGGGCAGAT CACCTGAGGT CAGGAGTGCA AGAACAGCCT GGCCAACATG GTGAAACCCC 21900 

ATCTCTACTA AAAATACAAA AGTTAGCTGG GTGTGTACAT GTAGTCTCAG CTACTTGGGA 21960 

AGCTGAGGCA GGAGAATCTC TTCAACCGGG GAGGTGGAGG TTGCGATGAG CTGAGATCAC 22020 

GCCACCACAC TCCATCCAGC CTGGGTGACA GAGTGAGACT CCATCTCAAA GCAAAAAAAG 22080 

AAACATAGGT GGGACCCTTG GTGTGTCCTT AGGGCATGAT GGTTGAGGTA TACTGCTGGT 22140 

20 CCTGTCATGT AAAAGAAAAC GAGCCGACTC TGTGTCTACT GGAGAAAGCA CTGCATATAT 22200 

CAGCCACAGT CAATACCTCG CTTCTGCAGG GACGGTGGCT GCCAGAGTGG GAGGCTTTGG 22260 

TAGCACCCAT GTCGTGGAAT CACAATGTTG TCGATAGCTC TGGGGTCTTG TACAAAATGC 22320 

CAGATCCTCC CATTTGGTTT CCTTATGGGA AGGATCGCAG TACTATAATA CATGGGCTTG 22380 

TGCAAGGGAT CATTATACCC TTTTCTCTTT TTTTGCTTTT CTTTGAGACA GAGTTTCACT 22440 

25 CTCGTCACCC AGGCTGGAGT GCAATGGCGC GATCTTGGCT CACTGCAACC TCCACCTCCT 22500 

GGGTTCAAGT GATTTTCCTG GCTCAGCCTT CTGAGTAGCT GGGATTACAC ATGCCCGCCA 22560 

CCAGGCCTGA CTTATTTTTG TATTTTTAGT AGAGACAGGG TTTCACCAAG TTGGTCAGGC 22620 

TGGTCTTGAA CTCCTGACCT CAGGTGATCC ACCCACCTCG GCCTCCCAAA GTGTTGGGAT 22 680 

TTCAGGCATA AGCCACCAGG CCCAGCCTTT CTTTCTTTTT AAAATTAATC TTTGTTTAAA 22740 

30 AATACTCTCA TTTTTTATTT AATTGTAGCA CTCCTAGATC CCGAAAGCAG ATACACTCTT 22800 

GTTATGGGTC TGATTCTTTT CATTGCTTCA CGCCTTAGAG GATATTGTCC AATACTGGAT 22860 

AAAAGTTTAC TCAGGTCTAC TTCCACTTTA ACGGGGATGG CTGAATATCT CTTCCACTTG 22920 

GCTGTTTGTT TATAATGAAC TGACAAACAT ACAAATTTTC TTGAGTTCTG TGAGACATTC 22980 

TAGTAAATCA TCTAACCTGA AGAGCAGGTT GTGAGAACCC CTGATTTAGA AAGCCCAGTG 23040 
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GTCATAAATA TAAGTGGCTC TGGACTGGCT 
GGATTGAGCC CTTGTAATTT GTAGGATCTG 
TTACCTGTAT TATATTGGAC ACCCAGTTAG 
AATACCAAAT ATTTTATGTC AGGGGAGTGA 
5 TGGCTCACGC CTGTCATCCC AGCACTTTGG 
GAGATCGAGA CCATCCTGGC TAACACGGTG 
TAGCCGGGCG TGGTGGCGCG CGCCTGTAGT 
ATGGCGTGAA CCCGGGAGGC GGAGCTTGCA 
CCTGGGCGAC AGAGCGAGAC TCCGTCTCAA 
10 AAAAAACCCA TACACTTTAA GGAAAGCAAC 
GAGCTTTGAA GTAAGAATAA CAATTTTGCC 
TTTTGCCATT GTGAAAGGCT TCCCAGTACT 
ATAACTTTTT tTTTTTTTTT TTGAGATGGG 
GTGGTTCAAT CTCAGCTCAC TGCAACCTCC 
15 CAACGTCCCA AGTCGCTGGA CTACAGGTGT 
TTTAGTAGAG ATGGGGTTTC ACCATGTTGC 
CGACCCGCCC ACCTCGGCCT CCCAAAGGTG 
CGCTGCACCT GGCCTCTTGT CCTTGTGTTT 
TGCAACCAGA AAAAAAGGTT AGTGTAACAA 
20 TATGAAGATT AAAAAAATTT GAAAGTGTTT 
CATTTTGGTG ACTAGAAATA CAAGTTTATT 
CTGTCTCCCA GGCTGGGTGG CACAATCATG 
AGCGATCCTC CCACCTCAGC CTCCAGAGTA 
TGGCTAATAT TTGTATTTTT TGTAGAGATG 
25 AAACTCCTGG GCTCAAGCAA TGCTCCTGCC 
ATGAGCCACT GCACCCGGCT GAGTTTCTGT 
TTACAGCAGT CCCAGGAAAC TGAGCAACTG 
AAGGAGAAGC ACTTGTGGTT GGCACCAGCT 
CTTTGCCCGA CACGAGTGCA CAGCAGGCTG 
30 ACTTGTGCCG TATCCCCACC TGCTTTGCTG 
GCCCCAGACA CCAAGCAAGC ACCAGCTGTG 
TTTTGTGCTG CTGTAACAGA ATGCCACAGA 
TTTCCTCAAA GTTTTGGAGG CTGGGAAGTC 
GGTCTCTGCT TCCACGATGG CACCTTGACC 



CCCGGGGTCT 


GAAGTGTGGG 


CAGTCGGTTA 


23100 


ACACACACTC 


CAGGAAGGCA 


GTGTCAGAAT 


23160 


CGTTTGGAGA ATTGGTTGCT 


GGTATAGAAA 


23220 


AAGAAAAAAC 


AAAAACCCGG 


CCGGGCGCGG 


23280 


GAGGCCGAGA 


CGGGCGGATC 


ACGAGGTCAG 


23340 


AAACCCCATC 


TCTACTAAAA 


ATACAAAAAT 


23400 


CCCAGCTACT 


CGGGAGGCTG 


AGGCAGGAGA 


23460 


GTGAGCCCAG 


ATCGCGCCAC 


CGCACTCCAG 


23520 


AAAAAAAAAA 


CAAAAAAAAA 


AAACAAAAAA 


23580 


TGACAGCATT 


TGTTACCAGT 


GATAAAATTT . 


23640 


ATTGTGCCCG 


GGCCAAGAAA 


AAAAAAAGAA 


23700 


TTCTGATGAG 


CTTGACGGTG 


ATATTAACAA 


23760 


GTCTTGCTCT 


GTCACCCAGG 


CTGGAGTGCA 


23820 


GCCTCCCAGG 


TTCAAGCGAT 


TCTCCTGCCT 


23880 


GCGCCACCAC 


GTCCAGATAA 


TTTTTGTATT 


23940 


CCAGACTGGT 


CTCAAACTCG 


TGACCTCAGG 


24000 


GGAGGCCTTG 


CTGGGATTAG 


AGGTATGAGC 


24060 


TGCAGTGATG 


CAATGACCAT 


GTCTTACATT 


24120 


TGTTTATCCT 


GTTTTTCCCA 


GAGTAGACAT . . 


24180 


TGAATATAAT 


AAACTATGCT 


ATACACACAA 


24240 


GTTTGTTGTT 


TGTTGAGACA 


GGGCCCTGCT 


24300 


GCTCACTACA 


GTCTTGAACT 


CCTGGGCTTA 


24360 


GCTGGGACTG 


CAAACGAGCA 


CCACCACGCC 


24420 


GGGTTTCACC 


ATGTTGCCCA 


GACTGGTCTC 


24480 


TCGGCCTCCC 


AAAGTGCTGG 


GATCACAAGT 


24540 


TGTTTTAAGC 


CGCTTCATTT 


GTGGTACTTC 


24600 


CAGAACATCA 


AAATTGTTTT 


TCTTCAGCAA 


24660 


TTTCCTGTGC 


TCACTTCTGC 


ATGGCCGCAC 


24720 


TGGGGGAGCA 


ACTGGTTGAG 


TCAGGCCTCC 


24780 


GACACGCCTG TTTGGGGGGC 


ACCCACTGCT 


24840 


TCCAAAACTT 


ACAGTCACTG 


TCTTGGCCCG 


24900 


CTGGGTAATT 


TAATACAGAA 


" CAGAAATTTA 


24960 


CAAGAGCAAG 


GGGCCATCAG 


GTCAGGGCCT 


25020 


ACCGTGTCCT CACGTGGTCA GAGAGAGCCC 


25080 
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ACTCCCAGGA GCCCTTTTAA TAGAGCAGAA 
CGTGAACTTC GGAGGTGACA CATTCAGATC 
GTGGTTTTAA AATACGTTCA CAGACTGGCC 
CAGTTTGGGA GGCCAAGGTG GGTGGATCAC 
5 CAACATGGTG AAACCCCATC TCTACTAAAA 
TAATCCCAGC TACTCGGGAG GCCGAGGCTG 
TACAGTGAGT CGAGATCATG CCATTGCACT 
CTCAAAAAAT AAAATAAAAT AAAATACATT 
TAATCCCAGC TACTTGGGAG ACTGAGGCAG 

10 TTGCAGTGAG CTGAGATCAC ACCGCTACAC 
TCTCAAAAAA GTAAAATAAG GCCCTGCAGG 
TTTAGGAGGC CAAGGCGGTC GGATCACGAG 
TGATGAAACC CCGTCTCTAC TAGCCTAGCC 
TACAAAAATT AGCCGGGCAT GGTGGTGCGT 

15 GGCAGGAGAA TCGCTTGAAC CCAGGAAGCA 
GCTCTCTAGC CTGGGCGACA GAGCGAGACT 
AATAAAATAT GTTCACAAAT CCTTTGACAT 
CTCCTAAGCA TGAGTCTTCT CAGTGACTCA 
ACACCCAGAG GACATTGGGT TCCTCCCAAT 

20 CGCTGGCTTT GGGTCCCCCA GAGCCATGTT 
CATGTGGTAA GGAATGAAGG CCTCCTGCCT 
GGATGCCCCA CCTCCTGCCC AGCCTTCAGA 
CAACATCATG AGAGACTCCG AGCCAGAAAC 
ACTGACAGAT AATGTTCGTT GTTAATTAAG 

25 AACAGCTCTG TCTGGCCTCC CAGGAGGAGC 
CGATCAGTTT GTGAAGGAAT GTCCGCCCCC 
GTCATAAACT CCATCTGCCC TCAGTGCCTT 
TTGCGGGAGG CCAAGGCCCA CGTGTGTTTG 
ATGCCCAGGG TACAGGGATG CCATATACAA 

30 TGTCTGTATG CCCAGGATAC AGGGATGCTA 
TTTTGAGACA GAGTCTTGCT GTTTCGCCCA 
ACTGCAAGCT CCACCTCCCG GGTTCACGCC 
GAACTACAGG CGCCCGCCAC CACACCCGGC 
TTTCACCATG TTAGCCAGGA TGGTCTTGAT 
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CACTGCTGCG 


CTGCGGTTAA 


GTTTCCAACA 


25140 


ATAGCAGTCA 


CTCTAGGCAG 


AGTGTCTGAT 


25200 


GGGCACTGTA 


GCTCACGTCT 


GTAATCCCAA 


25260 


CTGAGGTCAG 


GAGTTCAAGA 


CCAGCCTCAC 


25320 


ATACAAAATT 


AGCCAGGTGG 


TGCATGCCTG 


25380 


GAGAATCGCT 


TGAATCCAGG 


AGGTGGAGGT 


25440 


CCAGCCTGGG 


CAACAAGAGC 


GAAACTCTGT 


25500 


CACAAGGCCG 


GGCACTGTGG 


CTCACGCCTG 


25560 


GAGAATCGCT 


TATAACCTGG 


GAGGTGGAGG 


25620 


TCTAGCTTGG 


GCAACAAGAG 


TGAAACTCCG 


25680 


CATGGTGGCC 


CACACCTGTA 


ATCCCAGCAC 


25740 


GTCAGGAGTT 


CGAGACCAGC 


CTGGCCAACA 


25800 


AACATGGGGA 


AACCCTGTCT 


CTACTAAAAA 


25860 


GCCTGTAATC 


CCAGCTACTC 


AGGAGGCTGA 


25920 


GAGGGTGCAG 


TGAGCCAAGA 


TTGCGCCGCT 


25980 


CCATCTCTAA 


ATAAATAAAT 


AAAATAAGAA 


26040 


TCCTCACCTC 


AAAAGCTGGA 


ACCCAACTCC 


26100 


CTTCTAACAG 


CAGAACTTAC 


ATGGTTCCCC 


26160 


ATCCCCCCAC 


CCAGCGACCC 


CCACCCAGGT 


26220 


TCAAGGACAC 


TCAGGCAGCC 


CCTGGATGTC 


26280 


GCAGCCTCGG 


GAGGGAGCAT 


TCTCAGAAGA 


26340 


TGGCCAGGAC 


CTCGTCCAAC 


GTCCTGACTG 


26400 


CCCCAGGTTT 


TGTACTCCTG 


ACTTATGGGA 


26460 


GGGTGACTTG 


TCACACACAA 


TAGGTCACTA 


26520 


CTGCCTTTCC 


TTTTCTTCAT 


GGGAAAAGTG 


26580 


ACTTGATGCC 


AGAGGCTCCA 


CATGGTGACT 


26640 


GCCAGCACCC 


GGCCTGCGAT 


CAGCTTGGTC 


26700 


TGTGTGGTGT 


CTGTGTCTGC 


GTGCCCATGC 


26760 


ATTCTTTCAA 


TGTTGTATGT 


GGCATGTGTG 


26820 


TATACAAACT 


CTGTTTTTTC 


GTTTTTTTTT 


26880 


GGCCGGACTG 


CAGTGGCGCT 


ATCTCGGCTC 


26940 


ATCCTCCTGC 


CTCAGCCTCC 


TGAGTAGCTG 


27000 


TAATTTTTTG 


TATTTTTAGT 


AGAGACGGGG 


27060 


CTCCTGACCT 


CGTGATCCAC 


CCGCCTCAGC 


27120 
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CTCCCAAAGT GCTGGGATTA CAGGCATGAG 
TTTTTTTTTT TTTTTTTTGA GATGGAGTCT 
CGATCTCAGC TCACTGCAAG CTCCACCTCC 
CCCAAGTAGC TGGGACTACA GGCACACACC 

5 GCAGAGATGG GGTTTCACCA TGTTAGGCAG 
GCCCGCCTCG GCCTCCCAAA GTGCTGGGAT 
AAACTCTTTC AATGTCTTTC TTTTCTCTCT 
TGTCTCTACG TCTTCCCCAG CTGAGTCCGA 
GAGGAGAGGT GATAGCAAGA GCTCCTTCAA 

10 AGAGGAGGAA GCTGGGTCTC TCGGGGTTGT 
TCAGCACAGA AAGTCTAGAC ATCCACTACA 
GGGCACCCCA AGTGGCTGTT TGGTGGGACA 
TTTTATTTTT TATTTTTTAT TTTTGAGAGA 
GCTCCCTACA ACCTCCGCCT CCCAGGTTCA 

15 GCTGGGATTA CAGGTGTGAG CCACCACACC 
GGAGTCTCGA GGCTCTGTCG CCCAGGCTGG 
AGCTCCGCCT CCTGGGTTCA CGCCATTCTC 
^ CAGGCATGAG CCACCGCGCC CGGCCAATTT 
TGTTGGTCAG GCTGGTCTTG AACTCCTGAC 

20 AAGTGCTGGG ATTACAGGCC TGAGCCAGCA 
ATTACTCTGT CCCCCATCCT GGGTCCAGAC 
GTGAATGGGG TGAGTGGGGT GAGTCCACAG 
AGCACCTGCC CCCAGGTCAG GAAGTCCCAC 
ATATGTAGGC CCTGCCCTTT CCTCCTGAGC 

25 GCCCCTGAGG AGGGCTCCGC CCTGAGGGAG 
AGCAGCGGGC CCTGCCACCC CCCACCCTGA 
GTGGGCTTGT GTGTAGTTTA CATGCTCATA 
TCATAATAAT GATGATAAAA TAAGATTTAT 
GCAGATCTTC ACTCCCAGAC AGGGAGCCCG 

30 ACACACAGAC AGTTCAACCA TGTCTTAAAC 
TTGAATTTTT AAGTTCACTT TACTACGTGG 
CGCTATGAGC GCTGCCACCA TGAGGAATAT 
GTCCCACAAA AGCCAGATGT GGACAGCCTT 
GGGGCTGTCT TGGTGGGGAG AGGGAGATGG 
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CCACCACGCC 


TGGCCTACAA 


ACTCTTTCTT 


27180 


CACTGTCTTC 


CAGGCTGGAG 


TGCAGTGATG 


27240 


CGGGTTCATG 


CCATTCTCCT 


GCCTCAGCCT 


27300 


ACCACGCCCA 


GCTAATTTTT 


TGTGTTTTTA 


27360 


GATGGTCTCG 


ATCTCCTGAC 


CTCGTGATCC 


27420 


TACAGGCGTG 


AGCCACTGCG 


CCCAGCCTGC 


27480 


CCTGCCATCT 


TCTCCCTTGC 


AGATTTCTTT 


27540 


GGTCCTGACT 


TGCCCACGCT 


CCCTGGACTG 


27600 


GCCCAGGAAT 


GCCACCAGGG 


CTGCCCCGGG 


27660 


GGGGACCAGA 


CACCCTTCTA 


AGACATGGAC 


27720 


AACACATCTC 


CCTCCTAACA 


GGGGGCCCCT 


27780 


GGCATGTCCA 


TCAGTCAGAA 


TATCTTTATT 


27840 


GTTTCACTGG 


AGTGCAATGG 


CACGATCTCA 


2790O 


AGCGATTCTC 


CTGCCTCAGC 


CTGCCACGTA 


27960 


CAGCTAATTT 


TTTTTTTTTT 


TTTTTGAGAT 


28020 


AGTGCAGAGG 


CGCGATCTCA 


GCTCACTGAA 


28080 


CTGCCTCAGC 


CTCCCGAGTA 


GCTGGGATTA 


28140 


TGTATTTTTA GTAGAGACAG 


GGTTTCACCA 


28200 


CTCAGGTGAT 


CCACCTCCCT 


CGGCCTCCCA 


28260 


CGCCCAGCCC 


AGAATGTCTT 


CTTACTTTTT 


28320 


CTGTGACCGT 


GAACAACCGG 


CTGCCCAGGG 


28380 


AACAGTGGGG 


TGCAGCCCCA 


■ GGGGTCTCGT 


28440 


AGCCTAGAGG 


CTCCAGCCTC 


AGATGCATAC 


28500 


GGCGGGCCAC 


AGAGTCCTGA 


ACAACAGGAA 


28560 


GGCAGGGGAG 


CCCCCGCCAG 


CCCCACCCAC 


28620 


CACCTCACCC 


CTTGGATTCC 


AGAGAGGAAA 


28680 


TCTTAAAATC 


ACCGTTGTCA 


ATAGAACAAT 


28740 


AACCAGCTTC 


AGTGTGGAGA 


TACACACAGA 


28800 


CAGCTGCCCC 


CGACCCCACA 


GGTGCAGGAC 


28860 


ACACAGGTGT 


TTATTTAATT 


GTTCATTTGA 


28920 


ATGAGATGGG 


TGCATATTAC 


AGTAGGCTTT 


28980 


CCCAGCCCTC 


AGTTCTGCTT 


CCCTTTCTGA 


29040 


GGGTTCCCAT 


CCCAGCTGGC 


TGCTCCTTCT 


29100 


GGCAGTGGGT 


CCCTGCTGAC 


CCCTGAGCCC 


29160 
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TGCAGGGGTC AGGATCCTCC 
CCGGCCAAGG CCTCCCGCAG 
TCAGCCGCCC CCTTCACACT 
CCGTGGAGAA GTGGGGGATG 
5 CAGCTGACCG TCCTCCAAGG 
. GAGGGTGTGG GCAGGAGAGG 
AAGTGGGAAG GGTGTTGGGG 
ATCTGTGGGC AGCGGTC.CCA 
GCCGCTTCCA CCAGGGCAGA 

10 GGTCTTTCAA GGACATCTGC 
GAACTCTTGC ACTTCTGAGT 
GCCTGCTCAT GGGTCCCTGG 
CAACTGCAGG CTTCCAGGAA 
CCCCTGTGCA ACCACATTAA 

15 AAGCACCGGC TGCACACGCA 
CCTCACGGGG AGCCCCTTCC 
CAGGGGCGCC GGCTCAGAGG 
TAAGGACCCG CGATCCCACG 
ACAGGGCCCC CGGGGGACCC 

20 TCACGCCCGG CGGCTCCTGC 
AGGGCGGCTG AGTGCGGAAG 
AAAAGTTGCA TCTAGACAGA 
CAGGATGGGC GCCGCCCGTC 
TCCTGCGCAC CTCGGCCGCG 

25 GGGGGTGGCT TGGGGCGACT 
CCGCCCGGCC TCATCCGGGC 
GCACCGGGGA GGCGCAGAGC 
GGCCCCGCTT CTAGCTCGTA 
ATTTAGCCAT CTATTACTGC 

30 CTCGTGCGCT GCTGGGGACG 
TCAGCGCCTC GGAGTCCCCG 
CCCCGCCCAG TGCCCCGCCC 
GGCTCCGAGC TCCGCCCTGG 
CCCGCCCCCC GCCCAGTGCC 



CGTGGTCCCT GGGTGTGGCT 
GATGGAAGTT GAGGGCCCTG 
TTACAGCAAG GGGCCAGGCA 
CTGCAGTGGT ACAAAGACAG 
CCAGCACTGG GCGTCCAAGG 
CCTGGAGTCA GGCCTCCACC 
CTGGCTTGGG AACCTTACCC 
CAAAATCCCC. CTTGGGGCTC 
GGGCACAGGA GGGGCCAGCA 
AAAGCCCTGG TGGGGAGGGG 
GGGGGACTGT CCATGCTGCC 
CAGAGAATGC CCACTCCCCA 
GGCCCAGGGT GTCCACCTCA 
GGAAAGCTGC AGCCCCCACC 
GGCTCCCAGG CACCATCACC 
CCCTGGAAAG ACAGCAGGTA 
ACCTGCCCTG ACCTGCACGT 
CCACCGCCCT GGGTTTACCA 
AGCCGCGCCC GGCCTGGTGT 
AGGGGAAGCC GTGGTCAGCG 
AGAAGCATGA AGCTGGGGGC 
GGTGAACGAA ACAAAACCAA 
CCGGGCCCTT AGCCCGACAT 
TGCAGCTCCT GCAGGAGAGG 
CCGGGAACCC CCAGGCGCGC 
TGGCCTTCGG CAGGACCCTG 
AAGGCCAGGG ACCAAGGACG 
CCGGAGCCGA GCTTCCTTCA 
GGCTAGTTAC TGTCCCGCCA 
CCCAGTAAAC ACGGGAGGAG 
GCCCCGCTCT GCGCCCCTCC 
CCTGCCTGCT GCTAGCCCTG 
CCCCGCCCCG GCCCCTGCCC 
CCGCCCCCTG CCTGCTGCTA 



CTGGAAGACA 


CTGGCAGTGC 


29220 


GCTCTGGGTC 


CTAAGAGAAC 


29280 


GCAGCTTTGG 


GATGGGGCTT 


29340 


CCTCCCCCAC 


CGCCATCCTC 


29400 


GAAAGAAGGA 


ACTCAGCCCA 


29460 


CACAGCCCCC 


TGTGGGTGCC 


29520 


GCTGCCCTTC 


CAACACCTGG 


29580 


CCTGAGGAGG 


ACTTGTGGCT 


29640 


CTCCAAAGGG 


CTCTAGGGTG 


29700 


CCTGGGCCAG 


AGGCTCTTTG 


29760 


CACAACCTCT 


AGACCATGCA 


29820 


GCAGACTCAG 


GGCAGGCCCC 


29880 


CGCCAGGTGG 


TCTCAGAGGA 


29940 


CACCCGCCTG 


CCAGTTCAAC 


30000 


CCCCTCCCCC 


GTCGCCCCTC 


30060 


CTGTAGCCTC 


GCCTGCTGGC 


30120 


GCTGACCAGA 


CAGCCCAGCG 


30180 


CGGTCACCAC 


CACCTCTCTC 


30240 


CTGCACCGAG 


GGACCGCGTC 


30300 


ACTCACCACG 


AGGACAGGGC 


30360 


GGGGGTGGGG 


GAGGAGGAAC 


30420 


AACCCGAACG 


TGTTCCGTCG 


30480 


CTCTTCTCGC 


TGCTCCTTGT 


30540 


GGGCGGGAGG 


GCCTGAGGGC 


30600 


AGGCCGTGGC 


GCCCTGGCAC 


30660 


ACTGAGTTGA 


GGGGGCGGGA 


30720 


GGTTTCCTGG 


GAGCTGGCTG 


30780 


GGGCACTTTC 


AATATAATGA 


30840 


GGACCAGACT 


CTGGACCTGC 


30900 


CCCCCGACCC 


CCACCCCAGC 


30960 


GAGCTCCGC.C 


CTAGCCCCGC 


31020 


CCCCCGCCCC 


GGCCCCTGCC 


31080 


GCTCCGAGCT 


CCGCCCTGGC 


31140 


GCCCTGCCCC 


CGCCCCGGCC 


31200 
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CCTGCCCGCT 


CCGAGCTCCG 


CCCCGGCCCC 


GGCCCGGCCC 


CTGCCCGCTC 


CGAGCTCCGC 


31260 




CCTGGCCCCG 


CCCCCGCCCA' 


GTGCCCCGCC 


CCCTGACTGC 


TGCTAGCCCT 


GGCCCGGCCC 


31320 




CGGCCCCTGC 


CCGCTCCGAG 


CTCCGCCCCG 


GCCCCGCCCC 


GGGCGCTGCC 


CGCTCCGAGC 


31380 




TCCGCCCCGG 


CCCCGCCCCG 


GCCCCTGCCC 


GCTCCGAGCT 


TCGCCCCGGC 


CCCGCCCCGG 


31440 


5 


CCCCTGCCCG 


CTCCGAGCTC 


CGCCCCGGCC 


CCGCCCCCGC 


ACCTTCTCGC 


GCAGCCGCTC 


31500 




GCGCAGTGCG 


GCCAGGTGTG 


CCTCGCGGAT 


CTCCTTGCTG 


AGCTCCATCT 


TGTAGTTGAG 


31560 




CTTCTCCTCC 


GCCTGGCGGC 


TGAAGTTGTT 


ATTCTCCTCC 


AGCGCCTTGT 


GCAGCACCTC 


31620 




GCGCTCGTGC 


TCGCGCCGCT 


CCGCCAGCTG 


CTTCAGCACC 


TGCGCCTCCT 


GCGTCTGTGC 


31680 




GGGGCCGGCG 


GGCGCGCGTG 


AGCGGCAACC 


CCGGGCCCTG 


CCCGGCCGGA 


CTCCTCCCTG 


31740 


10 


CTCTCCGCCT 


CCCGCCCAGC 


GCCCGCTCGC 


CTCACCTGGC . 


GCCTCCACCT 


GCCCAGGCCT 


31800 




CGGTGGGCGC 


CGGGACCCCC 


GGGCGCTGCC 


CTGGGAACCC 


TCGCCTGCCA 


TCCGGCCTGT 


31860 




GGTCGGGGCA 


GGGGGAGGGG 


GTCGCGATCC 


GCCGCCCCCG 


CCCCCGTCCC 


TGCCTCGCGC 


31920 




GCGGGTCCCG 


CGGTCCTGGC 


TGC.GCCCAGG 


GCCCCCGCCA 


TACCCTGCCG 


CCACTGCACA 


31980 




CCCTGCCCTG 


CGCGTCTGCC 


CCTCCAAGGA 


CCAGCAGCAA 


GAAACCCTAA 


ACTTGTGGGC 


32040 


15 


GGTCTCTGAG 


CTTTGTCTCT 


TCCTCGGACA 


TCCGCCCACT 


GAGCAGAGTA 


GCTGCTTGTT 


32100 




ACACACCGGG 


TTCCCAGCTC 


CCAATTAGGT 


GCCCAGGAGC 


GGAGGGTCCC 


CAGGGATGCT 


32160 




GGGGGAGGGG 


CGGGCTGGTG 


ACCCCTGGGA 


GGAGAGCGGG 


GCAGCAGGAC 


CCGCACCCAC 


32220 




ATGCCAGTCC 


CTACTAGTCA 


GCCCTGTGAA 


CCCTGGTCTC 


TGGCCTCACC 


GGGAAGGGAA 


32280 




CGGAGCCGCT 


TCCCCTGCCC 


AATGCGTTGG 


CCTCCAGGGT 


GGCACCCCCA 


AAAGGACATT 


32340 


20 


TTTATCTCTG 


TTTCAGTCTC 


AGAGGGGCTG 


GTGGGAGGGG 


AGGCTGCAGG 


GAGGGGACCT 


32400 




GGAGCCCACA 


CCCACCTCTC 


CCAGGGCCCC 


TCCGCCCTCC 


AGCAAGCCTC 


AGGGTCTTCA 


32460 




CACATGAGGC 


CCTTCCTCCA 


GCTTCCCTGT 


CTGGGAGAGG 


GATGCCCCAC 


CCGACGTCCC 


32520 




CAGGGCCCAT 


CTGGGGACCA 


CCCCCTAGCA 


TCCTGCTGGC 


CCTGACAAGG 


GTGCCTCCCA 


32580 




CCCTCACCAG 


AGGCTCCTGC 


TCCTTCCAGG 


TGGCCGCCTC 


GGAACCCTTC 


CTCCTCTCCA 


32640 


25 


TCCCTTTCTT 


TTTTTGTTCT 


TGTTTGTTTT 


TTGAAATGGA 


GTCTCACCCT 


GTCGCCCGGG 


32700 




CTGAGGAGTG 


CAGTGGCGCA 


GTCTCGGCTC 


ACTGCATCCT 


CCACTTCTTG 


GGTTCAAGCA 


32760 




ATTCCCCTGC 


CTCAGACTCC 


CTAGTAGGTG 


GGATTACAGG 


TGTGCACCAC 


CACACCTGGC 


32820 




TAATTTTGTA 


TTTTTAGTAC 


AGATGGGGTT 


TCACCATGTT 


GGCCAGGCTG 


ATCTTGAACT 


32880 




TCCAACCTCA 


AGTGATCTGC 


CTGCCTCAGC 


TTCCCAAAGT 


TCTGGGATTA 


CAGGCGTGAG 


32940 


30 


CCACCACACC 


CGGCCTCTCC 


CCATCCCATT 


CTTATCTCTC 


AGAAAGAGGC 


CCAGGGAGCC 


33000 




ACAGCCCCTC 


CTGCTCCAGG 


CCAAGGCACT 


GACCAAGCCT 


GTCCGGGAGC 


ACCCTGCTTC 


33060 




TTGCAGGCCC 


TGTCCCCGTG 


GGCCGCCTCC 


GTTGAAACTC 


CTGGGGGGTG 


GGGGATGGAG 


33120 




GACTCCTTGC 


CTTCCTCCGC 


TCCTCGGCTG 


CCTCCAGCCG 


CTTTTGCAGC 


TCCTCCAGGG 


33180 




AGGTGTCCTT 


CTTCTTGGGT 


GGGGAGGAGA 


GCATAGGGCT 


CTCTGGGGAC 


AGGTCAGAAG 


33240 
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GGGACTTGAG GATGACCTCG AAGCTCTGGC 
CCATGTCTGC AGGGCAAGAC CAGAGTAGAG 
GCTGAGCGGG ATGCTGCCAG CACACATCCA 
CTCCCGCAGG AATCCCAGGC TTCAGCCCCC 
5 TCTCCAGAGG CAGCCAGGAC GGGAGTTCAG 
AGGTGGGCAG GCACCGCAGG GAAGGGCAGG 
AGACGGTATT GGGGTGCGGC TGTGTGTAGA 
GCTCCTTCAT CTTCTCCTTG TAGGCTGTGG 
GGGCCTGCCC ACCTGGGCCC CCGTTTTCCC 

iO GAGACACGGA GCTGCCCAGC ACGCTCTCTT 
TCTCCAGCTC TCTCGCTTCC AGACGTCGGC 
GTCTCTCTCG CCCTCTGCCT CTCCCCGCTT 
ATCTTCCACA CTGACTTGTT. TCTGCATCCT 
ATTACCCAAA ACTCATGTGT TGAAATCGTA 

15 CATTCGGAGG GAATTAGGCC ATGAGGATAG 
TCAGAGAACT CCCTCACCTT CCATCATGTG 
TGTACCAGGA GGCGAGACCT CTCCAGGCAC 
TCTGGCCTCC AGAGCGATGG GAAATAAGTT 
TACCTGCCCA GACTGACAAA GTGGCTACCC 

20 TGTGTCTGAC TCTGTCACTG TCATTGTATC 
CTGTCTTTCT CCCTGAGTGC ATCTTTCTGT 
CTTACCTCCC TCTGTCCCGC TACTTCTCTC 
GCTCAAGCAG GCAAGATTTA CTCATGACGG 
AGGACTTTCT TGGGCTGTAA ACCTGGATGA 

25 TGATTGTGTG GTCACCTTGA GATGAAACCA 
TCTAGGGCTG GGCTTGGGCC ACTGATGCCA 
CAGCACCACA GCCAGCCCTC TTGCCCATTC 
CCTGCTGCCC AACCCAAACC AGAAGCAAGC 
GGAAGAGGAA GGCGCCTGCT GTCACAGTGA 

30 GCGAACCTCA TTCCTAAGCA GACACGCTGG 
TTTGGCTGGG TGCAGTGGCT CATTTAACTG 
GTGGAACACC TGAGGTCAGG AGTTTGAGAC 
CTACTAAAAA TATAAAAATT AGCCAGGCGT 
GGGAGGCTGA GGCAGGAGAA TCGCTTGAAC 
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CTGAGGCCCG 


CTTGTCCAGC 


TGCTTCACCT 


33300 


CTTCAGAGGC 


CCGGCCAGGG 


CATGGCGTGG 


33360 


ACCCCAGGGC 


TGGGCGAGAG 


GGGGTGGCTG 


33420 


AGGATGGGCC 


CCTTCCCCCT 


AGAACCTCCC 


33480 


AGAGACTGCC 


GGAGGCCGGG 


GGAAAAGGTG 


33540 


CGGCAGCCAG 


GCACTCACCC 


CCGTACTGGT 


33600 


AGCAGGAGCA 


GATGAGCGAC 


AGCACCGACA 


33660 


GCACAAGGCT 


GGGCTGAGCA 


AGCACCACTG 


33720 


TCCCCATGGC 


TGCCTCTATC 


ATGTCTCTGT 


33780 


GTGTGTCTCC 


ACACCGCCGG 


CCCCTTCGTC 


33840 


ACTGTCTCCG 


TGGTGTGTCC 


CCTGCCTTCT 


33900 


TTCCTCTCTC 


TCGGCATTAA 


TGTCTGTCTC 


33960 


TCTCCTGCCT 


GCTGTGGTCT 


GAATGTTTCC 


34020 


ACCCCAAGGT 


GCCGGTGTGC 


GGAGGTGAGG 


34080 


AGCCCTCCTA 


AGTGGCCCCA 


GAGTGGGGCT 


34140 


AGGACACAGC 


CAGAAGACGC 


CACCCGTCTA 


34200 


CGACTCTGCC 


GGCACCTTGA 


TCCTGGACTT 


34260 


CCTGTCGTCT 


ATAAACCACT 


CAGTCTCAGG 


34320 


CTGCCTGTCT 


GGGTCTCTGT 


TTACCTTCTG 


34380 


TTTCTGTGTC 


TCTGGGGGTA 


GCCCCTGACT 


34440 


GATTCCTTGT 


CACTGTGTGT 


CTTTCTGACT 


34500 


TCCCCTCCTC 


CTCCTTCCCA 


CTCCTCGCCA 


34560 


GACCAGCACA 


GATGCAAACC 


CTCTGTGGGC 


34620 


AGCCCTCAGA 


CCCTCCTTTT 


TCCTTCCCAA 


34680 


GGCCCTCTCC 


AGGCACATGC 


TCTCTGTCTA 


34740 


CCAAGGAGCA 


AGGGAGGGAA 


GCTGTCCGTT 


34800 


AGGTCAATCA 


AGTGCCCACC 


AGCCAGTGTC 


34860 


CGGGCTCCTG 


TGGCCCTGTG 


CCCTGTCAGG 


34920 


AAATAATTTA 


GCTCTTTTGG 


TCTATTCAGG 


34980 


CCCGGTTTCT 


CACTAGTGCT 


CGATAATCCT 


35040 


TAATCCCAGC 


ACTTTGGGAG 


GCCAAGGCAG 


35100 


CAGCCTGACC 


AACATGGTGA. 


AACCCGATCT 


35160 


GGTGGCAGGC 


ACCTGTAATC 


CTAGCTACTT 


35220 


CTGGGAGGCG 


GAGGTTGCAG 


TGAGCCGAGG 


35280 
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TCGCGCCATC 


GCACTCCAGC 


CTGGGTGACA 


GTGTGAGACT 


CCGTCTCAAA 


ACAGAAAGAA 


35340 




AAAGAGAGAG 


AGGAAGAAAG 


GAAGGAGGGA 


GGGAGGGAGG 


AAAAGAAGAA 


AGGAAAGGAA 


35400 




AGGAAGACAG 


ACAAGGCAGA 


AGTAATCAAG 


CCTTTCATGG 


TGAGCTGGGT 


CTTCTGGTGA 


35460 




CAGTGCAGAG 


AATGGTCTGT 


CCTGACTTAA 


ATTTCCTGGT 


GACCTACACT 


TTTCTGGACA 


35520 


5 


GAGCAGCACA 


GAGCCGAAGA 


GGGTGTAAGG 


AGGAGCAGAA 


AGGAATCCCA 


GGGTGGGCAG 


35580 




GCCCGTGCGA 


GAGCCTTTGG 


GGGAAGGAAT 


GAGACTTTGA 


GCCGGGAAGC 


GAGGCAAAGC 


35640 




TACCTGTCTT 


GGTCATTGTC 


TTCAGGGAGG 


GAGATGGAGG 


GGGACCAGGT 


GGGGGAGCCT 


35700 




CACAGGGGAC 


TTTGGTCTGA 


CTTGTCAAGT 


TTTCTTTTTT 


TCTTTTTGAG 


ATGGAGTCTT 


35760 




GCACTGTTGC 


CCAGGCTGCA 


GTGCAGTGGT 


GCGATCTCGG 


CTCACCGCAA 


GCTCCGCCTC 


35820 


10 


CTGGGTTCAC 


ACCATTCTCC 


TGCCTCAGCC 


TCCCGAGTAG 


CTGGGACCAC 


AGGCACCGCC 


35880 




ACCACACCCA 


GCTAATTTTT 


TGTATTTTTA 


GTAGAGACGG 


GGTTTCACTA 


TATTAGCCAG 


35940 




GATAGTCTCG 


ATCTCCTGAC 


CTCGTGATCC 


GCCCGCCTCG 


ACCTCCCAAA 


GTGCTGGGAT 


36000 




TACAGGTGTG 


AGCCACTGTG 


CCTGGCCTAC 


TTTATTTTTT 


AGAAACAGGA 


CTGTGCTCTG 


36060 




TTGCCCATGC 


TGGAGTGTAG 


GGTGCAGCTG 


TGCGGTTCAC 


TGCAGCCTTG 


AACTTCTGGG 


36120 


15 


CTTGACGGAT 


CCTGCCATCT 


TAGCAGCTGG 


GACTACAGGT 


GCATGCCAGC 


ACACCAGTTT 


36180 




TCTTTTTTTT 


TTTATCTCTG 


CTCACTGCAA 


TTCCGCCTCC 


TGGGTTCTAG 


CGATTCTCCT 


36240 




GCCTCAGCCT 


CCCAAGTAGC 


AGGGATTACA 


CGCACATGCC 


ACCACACCCG 


GCTAATTTTT 


36300 




GTATTTTTAG 


TAGAGACAGG 


GTTTCACTAT 


GTTGGTCAGG 


CTGGTCTTGA 


GCCACCGCGC 


36360 




CCGCCCGGCC 


TACACACCAG 


CTTAAAAAAA 


AGAAAAAAAT 


AGCTGGGCGT 


GGTGGCTCAT 


36420 


20 


GCCTGTAATC 


CCAGCACTTT 


GGGAGGCTGA 


GGCAGGCAGA 


TCACCTGAGG 


TCAGGAGTTC 


36480 




AAGACCAACC 


TGGCCAACAT 


GGCGAAACCC 


TGTCTCTACT 


ACAAATATAA 


AAATCAGCCA 


36540 




GGCGTGGTGG 


CGGGCTCCTC 


TAATTCCAGC 


TACTTGGGAG 


GCTGAGGCAG 


GAGAATCACT 


36600 




TGAACCCGGG 


AGGTGGAGGT 


TGAAGTGAGC 


CAAGATCGAG 


CTACTGCACT 


CCAGCCTGGG 


36660 




AGCAAGACTC 


CCGTCTCAAA 


AAAAAAAAAA 


AAATTTGTAG 


TGGTATGGAG 


GCCGGGCATG 


36720 


25 


GTGGCTCACG 


CCTGTAATCC 


CAGAACTTTG 


AGGGGCCAAG 


GCGGGCAGAT 


CATGAGGTCA 


36780 




GGAGTTCGAG 


ACCAGCCTGA 


CCAACATGAT 


GAAACCCTGT 


CTCTACTAAA 


AATAACAAAA 


36840 




ATTAGCCAGG 


CATGGTGGCG 


GGCACGTGTA 


GTCCCAGCTA 


CTCGGGAGAC 


TGAGACGGGA 


36900 




GAATCGCTTG 


AACCCAGGAG 


GCAGAGGTTG 


CAGTGAGCTG 


AGATCACGCC 


ACTGCACTCC 


36960 




AGCCTGGGTG 


ACAGAGTGAG 


ACTCTGTCTC 


AAAAACAAAC 


ACAAACAAAC 


ATATATATAT 


37020 


30 


ATACATGTAT 


ATATATAATA 


TATATATACG 


TATATATACA 


CGTGTATATA 


TATAATATAT 


37080 




ATACGTATAT 


ATACACGTGT 


ATATATAATA 


TATATACGTA 


TATATGTATA 


TATTAATATA 


37140 




TATACGTATA 


TATACACGTG 


TATATATTAA 


TATATATACG 


TATATATACA 


CGTGTGTATA. 


37200 




TATTAATATA 


TATACGTATA 


TATGTGTGTG 


TGTGTATATA 


TATATGTATA 


TATATATATA 


37260 




TATATACATA 


TATATATACA 


GAGAGAGAGA 


GAGTAGTGAT 


AGGTCTTGCT 


GTCTTGTCCA 


37320 
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GGCTGATCTT 


GAACTCCCGG 


CCTCAAGAGA 


CCCTCCCACC 


TCAGCCTCCC 


AAAGCACTAG 


37380 




GATTATAGGT 


GTAAGCCACA . 


GTACCTAGCC 


TATTAAAAAT 


TAATGTTAAA 


CAAGAGGATG 


37440 




TGATGAGGGA 


GTTAGAGGGT 


GTGCCAGCCA 


TGTGTTCCAC 


AGCAGCAGGT 


CAGGAGACAT 


37500 




TGGGGACATT 


TAGAGGAGCT 


GAAGAGGTGG 


CCAACCCTGT 


GCTCAGGAGG 


ACGGGGGAGG 


37560 


5 


GAGAGAGCAA 


GAGGGAGTTT 


GGGCTGGGGC 


AGAACGTACC 


TGGGTCCTGA 


GAGGATAAGA 


37620 




AGGTAGGGAC 


TTGGCCCCTC 


CAGGCCTGAC 


TCTGCCAGCA 


ACCAGCTCCC 


TATCAGCAGA 


37680 




CTCCAGGCCC 


CTACCCTTCA 


GCTCATCCTT 


CCTTATCACA 


CATCCAAAAC 


TCTGAATGTG 


37740 




GCCGGGCGCA 


GTGGCTCACG 


CCTGTAATCC 


CAGAACTTTG 


GGAGGCTGAG 


GCAGGAGGAT 


37800 




CGCTTGAGAA 


CAAGAGTTTG 


AGACCAGCCT 


AGGCAACATG 


GTGAAAGCCC 


ATCTCTACTA 


37860 


10 


AAAATATAAA 


AATTAGCTGG 


GTGTGGTGGC 


ACATGCCTGT 


TGCCCCAGCT 


ACTCAGGAGG 


37920 




CTGAGGCAGG 


AGAATCACTT 


GAGCCTGGAA 


GGCGGAAGTT 


GTAGTGAGCA 


GAGATTGTGC 


37980 




CACTGCGTTC 


CAGCCTGGGC 


AACACAGCGA 


GACTCTGTCT 


CAAAAAACAA 


AAACTGGAAT 


38040 




GTGTTTACCA 


TAAAGGCCAG 


AAAATGTGAT 


TAACAGCTGC 


TCAAAGCCCC 


TGTCTGCCCT 


38100 




AAGCCTGAAA TTTTCACCGA AAAAAAGATC TGTAGGCTCA 


TACAGAGGAA 


GGACAAACAC 


38160 


15 


CAGGGAGGCT 


CTCTTCCAGT 


TTGCTTCACC 


TCAGCAAGCA. 


GACGGCTGGC 


AGCAATTTGG 


38220 




GGGCAGGTGT GAGCACCTGC- ATCATCAGGA AAGAAGGGGC 


ACGGTGGGGA 


CGCAGGTCAG 


38280 




ACCTCTCACA 


GGTCTTGGCT 


CTGCCCAGGA 


GACACGTGTC 


CAACTGAGAG GTGAGGAACT 


38340 




GGGTTCTGCA GCTGCAGACA CAGGTGCGGC TCAGCATCTG 


ATGGCCACGG 


AGACCCCCTG 


38400 




GCTTGGCTTC 


TCCCAGCTGG 


TGGCCCATGA 


GGAGCTTCTA 


TCCCAAGAGA 


CTGTCCCTCA 


38460 


20 


AGGAGCAAGT GGGACCAGGT ACCCACAGGA CGGAGCCTGG 


GAGTGAGGCC 


TGCCCTGTGG 


38520 




TCTGGCTACA GGGAGGAAGG 


GCAGATTGGA 


GGGGGCAGGA 


CAGCAGGTCA 


GGAATTGGCC 


38580 




AACTCTGGAG 


AGAGCAAGCA 


AGGGGAAGTC 


TGCGCACAGG 


GCAGGGCTGG 


TCAGGGGCGA 


38640 




GGCAGGGCAT TGGACCAGTA TTTTCAGAGC TGGTGAGGCT 


TAAAGAGCAT 


GTCTACTGCC 


38700 




TCTTATTACA GAGAGAGGAT 


GCCGAGGCCC 


AGACCCATCC 


AGGCCACCTC 


TCCACAGACA 


38760 


25 


CAGCTGGTGC 


CAGGGAAGCC 


CCTCCCAGAG 


CCTCAAGGCA 


TTGCTCCCTC 


TCTCTCTCTC 


38820 




TTTTTGTTTT 


TTTGGAGACG 


GAGTCTCACT 


CTGTCTCCCA 


GGCTGGAGTG 


CAGTGGTACA 


38880 




ATCTCGGCTC 


ACGGCAAGCT 


CCGCCTCCCG 


GATTCACGCC 


ATTCTCCTGC 


CTCAGCCTCC 


38940 




CGAATAGCTG 


GGACTACAGG 


CGCCCGCCAC 


CACGCCCAGC 


TAATTTTTTG 


TATTTTTAGT 


39000 




AGAGACGGGG 


TTTCACTGTG 


TTAGCCAGGA 


TGGTCTCGAT 


CTCCTGACCT 


TGTGATCCGC 


39060 


30 


CCGTCTCAGC 


CTCCCAAAGT 


GCTGGGATTA 


CAGGTGTGAG 


CCACCGCGCC 


TGGACTTTTT 


39120 




TTTTTTTTTA AGACGGGGTC 


TCACTCTGTC 


ACCCAGGCTG 


GAGTGCAGTG 


GCGCGATGTC 


39180 




GGCTCACTGC 


AACCTCTGCC 


TCCCCAGTTC 


AAGTGATTCT 


CCTGCCTCAG 


CCTCCCAAGT 


39240 




AGCTAGAATT 


ACAGGCACAT 


GCCACCATGC 


CCAGCTAATT 


TTCTGTATTT 


TTAGTAGAGA 


39300 




TGAGGTTTCA CCATGTTGGC 


CAGGCTGGTC 


TTGAACTCCT 


GACCTCCGGT 


GATCTGCCCA 


39360 
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CCTCAGCCTC CCAAAGTGCT 
TGCTGGGATT ACAGGCGTGA 
TTCTCACCGC CCCCTCCCCA 
TTCTCGCAGT GAGCTGGGCT 
5 ACCGAGGAAA TGCGGATTGG 
■ AGGTTGGCCT CAAGCAGCAG 
ACCCCTCCCG CCGTCCAGGT 
CTCCCGCCGT CCAGGTGGAT 
CGTCGCCCAG GTGGATGGTG 

10 CACCCGGGTG GATGGTGCCC 
GCCCGCTCCC AGGGTCACAC 
CGCTCCCAGG GTCACACCTC 
CCCAGGGTCA CACCTCACGC 
GGGTCACACC TCACACCCAC 

15 TTCTCCTTCT CTTTCGTCTT 
TAGAAAGAGT TTCTACTCTT 
TGCAACCTCC CTCTCCCCGG 
ATCACAGGAG TGTGTCACCA 
TCACCATGTT GGCCAGGCTA 

20 CTCTCAAAGT GCTGGGATTA 
GGAATGAAGG CTCAGAGAGG 
AGCCCAGTCA TGAGCCTGGG 
GATCTGTCCA GAACTGAGAG 
GGAGGCTGCT GCCGGCTCCC 

25 TCATGGTGCC AGGAGCACAG 
GGAAACTCAG AAATGTGGCT 
CTTAGCAGGC AAGCGGCCAA 
GGAGAGGTGC CAGCACCGTC 
GGCAGCTCGG ACAACACCAC 

30 TGTGAGCAGA GGCCACAAAA 
CTCCCAGCTC AGCCCCAGGA 
GGAAGGGCAG GAGGCGGCCC 
CCCTCCGCTG AGAGCCCCCC 
GGAGGAAGCG CAGATGGACA 



GGGATGACAG GCGTGAGCCC 
GCCCCCGCGC CCGGCCCCTC 
CCATCCTTCC CCTTCACTGA 
TGTTTTGTCT CCCTGCTTCT 
CCTTTGGATG ATTCATGAGC 
GGTATAGTAG TGCCCGCTCC 
GGATGGTGCC CACTCCCAGG 
GGTGCCGACT CCCAGGGTCA 
CCCACTCCCA GGGTCACACC 
GCTCCCAGGG TCACACCTGA 
CTCACGCCCA CCCCTCCCGC 
ACGCCCACCC CTCCCGCCGT 
CCACCCCTCC CGCCGCCCAG 
CCCTCCCGCC CACCCGGGTG 
CTTCGTCTTC CTCCTCTTCT 
GCTGCCCAGG CTGGAGTGCA 
GTCAAGCAAT TATCCTGCCT 
CACCTGGCTA ATTTTGTACT 
GTCTCGAACT CTTGACCTCA 
CAGGCATGAG CCACCCTATC 
GCAAGCATTC CACCCAGCAT 
CCTGACTGCA GGCTCCTGTT 
GCCAGGGGAC CACAGTGGCC 
CCCGGGGGCA GATGGAGGTC 
CATGGCAGGA GCCACCAGCC 
ACCACAGTCA GGCTGCTTGA 
GCACCTTTCC TGAAATATTG 
ATCTCTACCC AGATAAGGAG 
ACAGCTGGAG GAGGTCAGAC 
GAGCGAGGAG CCAGTGCCCA 
ACCGAGCCCA TGGGGAGGGA 
TGGAGAGGAC GGCGCTGCCC 
CACCCCCAGT ATCCCCGGGG 
GGACTCCCAG ATAGGGTGGG 



CCGCGCCTGG 


CCCCCCGCAG 


39420 


CCTCTCTTTG 


ACTCCCTTCT 


39480 


CTTCAGGGAG 


TTAAAAACAA 


39540 


CTTTGTACTA 


AACATTAGAT 


39600 


AGGAGTCAGA 


AAAAGGCACC 


39660 


CAGGGTCACA 


CCTCACGCCC 


39720 


GTCACACCTC 


ACGCCCACCC 


39780 


CACCTCACGC 


CCACCCCTCC 


39840 


TCACGCCCGC - 


CCCTCCCACC 


39900 


CGCCCACCCG 


GGTGGATGGT 


39960 


CCGCCCGGGT 


GGATGGTGCC 


40020 


CCAGGTGGAT 


GGTGCCCACT 


40080 


GTGGATGGTG 


CCCACTCCCA 


40140 


GATGCCCTTA 


TCAGCTCTCC 


40200 


TTCTTCTTTT 


TTTTTTTTTT 


40260 


ATGGCACAAT 


CTCAGCTCAC 


40320 


CAGTCTCCCA 


GATTGCTGGG 


40380 


TTTAGCAGAG 


AGGGGGGATT 


40440 


GTTTATCeAC 


CGGCCTCAGC 


40500 


TGCCTCACTT 


CTACAGAGGA 


40560 


CACACAGAGT 


GCCGGGTGAG 


40620 


GGGAGCTCGC 


GGAGGTGGGG 


40680 


TCTGACCCCT 


GGAGGGCCCT 


40740 


ACTGTCACCC 


AGGCTGCTTC 


40800 


GATTTGCCTT 


TCCCTGGGCA 


40860 


CGTGCGGTGA 


GCACTCATCT 


40920 


AGGCCTCAGA 


ACAAGCCCCA 


40980 


ACCCAGGTCC 


TGAGAGGTTA 


41040 


TCTGGGTTGC 


AGAAGGAGAA 


41100 


GATGCCGAGA 


TGCCCTCGCC 


41160 


CCGTCAGGGA 


AAGGCTGTCA 


41220 


TCAGGGGCAG 


GAGGGGAGTC 


41280 


GTGTCCAGGA 


GGAGGCGGAG 


41340 


GAGGTGTGGC 


CGGTGACACA 


41400 
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CACGGTCCCC TCCTGGCAGG TGCTGAAGTC 
GGGGCGGGGT CAGGTCGGGC ACGCGTGGGT 
CCCGAGTTGA ACCAGTCAGC TCGGGAGAGG 
AAAAGGTGGG AGGGAGTGCG GCTCGTGAAC 
5 TTCGTCCTGT CCTCCCAAAC GTCGAGTGAA 
GAGCAGAGCG TTTCGGGAAG GGCGGGCCCA 
TCCCCTGCGT CCAGAATCCG CCCCCCGCCC 
CGCGCGGGAC GGGCCGGGAG GCCGGGGTGG 
CGCGGGAGGT GGAGGGGCGG GAGGGGCGGA 

10 GCAGTCGCCC CTCCCCCATC CAGACCCCGC 
AGCAGGTGGC GCGGGCGTCG CAAAGGGTGG 
GCGCGGTCCC CCTCACTCCG GGCTCCGCCG 
CCTCCAGCCC CTGTGCTGCA CTGGCGCGGG 
AGAGGGTCCC ACCCTCTCCC CGCCTCCCCA 

15 CGCCGGACGC CCCAGGGACC CCGCCCGCAC 
CACGCCCGGG TCTGGGAGCC ACCCTGCGGC 
CAGGGGCGAG GCGCCCCCGC CCGACGTCCC 
GCAGCCCAGC GCCCCACCAG CCCCGCCGGC 
CCCAGGCTTG CAACGCGCAG GGTAGGAGAA 

20 CCCAGGCGAG GCCAGACGAG GCCTCTGCTC 
CCGGAGCCTA CCGGAAATGG TGCTGGCCAT 
TGGAGAGGCG CAAGTGGCGG CCGGAGCTGC 
GAGGGGAGAG GAGTGGAGGG AGCGAGGGCG 
AGGGAGGGAG ACAGAGGGAG . AGAGAGAGAG 

25 GGGAGGGTTG GGGGAAGGAG AGAGAGAGAG 
AGGGAGGAAG GGAGGGAGGA AGAGAGAGAG 
GGCTCTCAGG GGGAGAGGGC GGAGGAGGGG 
CCTCCAACCG CTCACAGCGG GACAGCTGCT 
TGGGGTGGGG GTGGAGATGA GCCCCCATTC 

30 GAGGAGCAGG TCCCACCCAT GGGGTGGGGC 
AGGCTGGGCT GGCGCCAACA GGGTCCAGGT 
CCACAGGGGT GGGCGCTGGC CTGGAGGCCT 
TGGGCAGGAT CTGAGCGGAG AAGAGTGAAA 
GCCGTGAAAG AGGTGGTTTC CCGCCTGGGC 
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ACCTGGAGCC 


TCCAAGCCCG 


TGGGGCCTGA 


41460 


GGGCGGAGTT 


CTGCGCCCCG 


GGCCAAGGCG 


41520 


GACCGCGGCG 


ACCTGTCCCG 


GGGGCGTAAG 


41580 


GGGGGCGGCG 


ATGGGAAGGA 


GGTGCGGCCC 


41640 


AAACGAAGCG 


GGTTCTGCGG 


CCTCGCGGCG 


41700 


GCGTCCTCGC 


GCCCGAGGTC 


GCCCGGCAGC 


41760 


GGGCCTGCGC 


CCGCCCCTCC 


GCCTGAGCTC 


41820 


GCGCTACCTT 


CGAAGGCGGT 


GGGTCCGCCC 


41880 


GCCCTCTGGT 


CTCCGGAGGG 


TTTGGGGATC 


41940 


GGCGCAAAGG 


GCAGTGGCTT 


TTCTGGCCAG 


42000 


TCCCCGAGGC 


CGCAGCGGTG 


TGGGGGGAGG 


42060 


TGTCTGGCCC 


GCCCCCCTCC 


TTCAGCGCCC 


42120 


GAGCGCCGGG 


TTCCCGGCTG 


GGGCTTTGGC 


42180 


CGAAGGCTCT 


GGCGGACCCA 


GATCTCGGGT 


42240 


ATCGCGAGCG 


CGCCCACCCG 


GTCGCGAGCC 


42300 


AGTCGCGCCC 


TGCGTGGCAC 


GCTGCTCCCC 


42360 


GGTCCCGAGC 


GCTCCCCGGC 


GCGGCGCCTC 


42420 


GCCGCAGACC 


CCAGCCTCGG 


GCGGGTCGGG 


42480 


GGGAAATTGG 


CGTCCGCTGC 


CGGCCGCTGC 


42540 


AGATCCCGCC 


GCCCCACAAA 


GCCCGTGGCC 


42600 


GGTGCTGGCG 


GGGGTTGGGC 


CTGCGGAGGC 


42660 


AGACGGCTGG 


TGCTGCAGTG 


CCGGGGAGGG 


42720 


GGCGGGAGGC 


GGGCGCGGCG 


GGAGAGAGAG 


42780 


GGTTGGGGGA 


AGGAGCGGGG 


GGAGGAGGGA 


42840 


AGAGAGACTG 


CGGGGGCGGG 


GGAAGGAGGG 


42900 


GAGCAAGCGC 


CTGGCTGCGG 


AAGGGGCCGC 


42960 


GGCTACCCGA 


ACTGCAACAA 


GACCCCCCAC 


43020 


TCTCCAACTT 


GGCTTTGTGA 


GGCCTGAGAG 


43080 


CCCAGGGCAG 


GCGGGGCAGG 


GGCAATGCCG 


43140 


CGCAGAGCTC 


TTCGCCGCCA 


AGGCCGCTGT 


43200 


CTGTGCCTGC 


CATCGGAGAG 


GATGCCACAG 


43260 


CCAAGGGGCA 


TCTCCTGTGA 


GCCCAGGGGA 


43320 


GTGGAGGAGT 


GAGGCCAGAA 


CAAAGGCTTT 


43380 


TCAGACCTTC 


ACTCACTGTG 


TGGCCCAGGC 


43440 
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CAAGGGCAAG 


CGTCTGACCT 


CGCTGGGCCT 


TTGTTTCTCA GGGGTAAGAT 


GAAACAATGA 


43500 




TGCCCCCAGA 


CGATGGAGAG 


GAGGGGTGCC 


AGGGTTGTGC 


GCACTTAGTG 


AGTGGGGGGC 


43560 




AACCTATCCT 


GCCTCCCCCT 


CTCCTCATAA 


CTCCCAAAGG 


GAAAGCCTGG 


TAGGCAAACG 


43620 




, GAGCGTCTTT 


GCCATTGCAG 


GGATGAAGCC 


ACCGAGGCAG 


GGAGAAAAGT 


GCTTTGCCCT 


43680 


5 


ACAAGCAACT 


AAGTCATAGG 


GCCAGGAGCA 


AAACCCTGAA 


AACCTCAGGA 


GACTTGCAGA 


4374.0 




GCCATGAGGC 


TGGCTCAGCA 


ACACAAAAGC 


CAGGGGCAAG 


CCTCAGCTCT 


AGCAGTGCGG 


43800 




TGGGAGCACC 


CAAGGCCAGT 


CACATCCTAG 


GGTGGCCTGG 


AGAGTCCTGA 


CCCCTGACGT 


43860 




GCAAGCCGGC 


ATCATCCCCG 


GGACTGTGAG 


TCTGGTGGGG 


GTGATGCCCA 


GGAATGTGAC 


43920 




ATTGTGTGGC 


CCAGAGGTAC 


CCTTAAGACT 


GGAGGATCAC 


CAGGCGGGCC 


CTGACCTCAT 


43980 


10 


CACAGGAGCC 


CTTTAAAAGC 


AiGTTTCCTTT 


GCCTGGTTGA 


AGAAATCGGA 


GGGATCAAAC 


44040 




CAAAGAAGGT 


TTTCTGTTGT 


TGAGATGAGG 


GGGCCACGTG 


GCAAGGATCT 


GAGAACTGCT 


44100 




CCCAGCGAAC 


AGCCAGCAAG 


ACAACAAGAC 


CTTAACTGCA 


AGGAAGTGAG 


TTCTGCCAAC 


44160 




. AAGAAGAGAA 


TGGGCTTGGA 


GGCAGGTTTG 


ACCCCAGGGC 


CTGCACACAA 


GAACTGAGCC 


44220 




CAACTGCCCA 


CTTGGTTTCA 


GCCTTGGGTT 


ACTAAGAATT 


AGGAGGTAAT 


GAATGAGAGT 


44280 


15 


TGTTTTAAGC 


TGTTGGTTTT 


GTGGTGATTT 


GCTATGAAGC 


CATATCAAAC 


TAATATACAC 


44340 




ACAGAGGTGT 


TGGCCCCTGG 


GCCATTCCTA 


GGAAGCCAGC 


TCTGCGAAGG 


AGGAAGAAGG 


44400 




GCAGAGAGGC 


ACACAGAGCT 


GCCCACCACA 


GCAGCTGTGT 


CCTCCCTGTT 


GGCCACCACA 


44460 




GTAGCAGTTG 


GGGATGGTCA 


GCATCCTTCA 


GGCAGACTCC 


AGCCCCGGGT 


GCTGGAGCTC 


44520 




AGGTGCTAGG 


GATCAAGAGA 


AGTAGCCCTC 


TCTGGGACCT 


CCAGAGTCTT 


CTCATGTGGG 


44580 


20 


TGGGGTAGGA 


CCCACCCAGT 


CAGGCTCAGA 


GCACCGCAAT 


GCCTCACACT 


CATTGTGACT 


44640 




■ CTGGCCAGGC 


CCTCTCTGAG 


CCTCTGTGTC 


CTCATCTGGA 


GCACAGGGAC 


CAGGTGTGTG 


44700 




GAAGCCCGTG 


GCATAGTGCC 


AGGAACACAG 


TAGATGTGCA CAGTGTGCAC 


TAGCAGGAAC 


44760 




ACACAACAGG 


GGTACTGACT 


GTCAGCACCT 


AGGCAGGCAC 


ACGCAATGGG 


GTACTGACTG 


44820 




TCAGCCATAC 


TGACTGTCAG 


CGTGCTAGCA 


GGCATACACA 


ACAGCTGTAC 


TGACAGCACA 


44880 


25 


CTAGCAGGCA 


CATGCCATAG 


GTGTACTGAC 


TCTCAGTGCA CTGGCAGGCA 


CACGCAATAG 


44940 




GAGTAATGAC 


AGCATGCTGG 


CAGGCACACA 


ATAGCTGTAC 


TGACTGTTTG 


CCCCAATATA 


45000 




GTGCCAGGTC 


TTGGAGCAGA 


TTTTGACTTC 


TCACCAAGAT 


CAAATGCAGA 


AAGTGCACGA 


45060 




GCATTTCAAA 


GATGTTTTTC 


ACATGCACAT 


TAGTGCTAGT 


TAAAAAAATG 


TTTTGACTGG 


45120 




GTGCAGTGGC 


TCACAACTGT 


AATCCCAACA 


CTTTGGGGGG 


CCGAGGTGGG 


CAGATCACCT 


45180 


30 


GAGGTCAGGA 


GTTTGAGACC 


AGCCTGGCCA 


ACATGGTGAA- ACCCCATCTA 


CCCTAAAAAT 


45240 




ACAAAAATTA 


GCCAGGTGTG 


GTGGCAGGTG 


CCTGTAATCT 


CAGCTACTTT 


GGAGGCTGAA 


45300 




GCAGGAGAAT 


CACTTGAATC 


CAGGAGGCAG 


AGGTTGCAGT 


GAGCCGAGAT 


CCCACCACTG 


45360 




CACTCCAGCC 


TGGGCAACAA 


TATCAAGACT 


CCACCTCAAA AAAAAAAATG 


TTTTTCATAA 


45420 




AGTGTGACTT 


TTATCAGACC 


TCTGCATTCT 


TGAAATTAAC 


TCTGGCTTGG 


CTGGGCGTGG 


45480 
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TGGCCCACAC CTGTAATCTT 
GAGTTCAAGA CCAGCCTGAC 
TAGCCGGGCG TGGTGGCATG 
ATCGCTTGAA CCCAGGAGGT 
5 CCTGGGCGAC AGAGCAAGAC 
GGCTCCTAGA AGGAGCCCTA 
AAGCACCTGC TGTATGCTGG 
TACAGGAGTT CCTGACCACC 
TGACTCAGAC TTCAGCTCAG 

10 GAGATGTGTG GCTGCAAAAG 
CAACGCTCTG TGCCTCACTT 
CTGGGCGCGG TGGCTCACGC 
ACCTGAGGTC AGGAGTTCAA 
AAATATAAAA AATTAGCCGG 

15 CTGAGGCAGG AGAATCACTT 
CACTGCACTC CAGCCTGGGC 
CAGAAGGTGA AGTCTATGTT 
GAGAGAGAAG GGGAAACTGC 
AACAGCCTGG AGGAAACTGA 

20 CACGTGGGGA TTACAATTCG 
GCATGTATGG GGGGCACTGA 
GCTGGTCTCC TCATCTTCAA 
- ACCACAACAG GCACATGGGG 
TCACACCCAG GAGCGTGGGC 

25 CAGCCATGAA GAAGCCACGA 
AACTCCCTCC TGGGGCCTTC 
AAACATCCGT GACGCTGGGC 
CCCTGCTGCC AGCCAGTGCC 
GGACAGTGAG CCGAGCAGAA 

30 TCTCAGGAGC CGGGTCCTGG 
GCTCACACTC GAGCAGTAAA 
AATGGTGGTG GCTCGGCCTA 
CATCAAGGGA GGAGACGGGC 
TCACTGGCCA GCGGAGCCGG 



AACACTTTGG GAGGCTGAGG 
CAACATGATG AAACCCCATC 
CACCTGTAAT CCCAGCTACT 
GGAGGTTGCA GGGAGCCGAG 
TCTGTCTCAA AAAAAAAAAA 
TATCTCAGCA GGACAGTGAG 
AGCTGTGGGT ACGTCAGCAA 
CCAGGCCAGC ACGCTCCTAT 
TCCACAGGAC AGCCTTTTCT 
GTAAACTCCT GGCTCCTGAG 
TCTCACCCAG AAAGTGTGGA 
CTGTAATCCC AGCACTTTGG 
GACCTGCCTG GCCAACACGG 
GCTTAGTGGT GCACACCTGT 
GAACCCAGGA GGTGGAGGTT 
AAAAAGCCAA AACTCTGTCT 
AGTCCCAGTT CCCAGGTCGT 
CACTTTTAAA CCATCGGGTC 
CCGCATGATC CAACCACCTC 
AGGTGAGACT TGGGTGGAGA 
AACTTGTGCT TGGTGCCCAT 
CTCCCTGCCG AGTCTCAGAT 
CTGGGTGCCA GTGAGTGCTG 
ACGCATGGCA TTCGCATGTG 
AGAAGTGCTG CCTGCCGGCC 
TGGGGCCTTC TGGGGCATGG 
TGACCCCACA GAACGGTGCG 
GGGTGCTGGG GACTCAGGGA 
GCTGGAACGC AGGAGAGGAA 
GCAAGGCGCA GCCGTTTTCA 
AAGATGCCTC TGGGGAGGAG 
GAGAGGCGGT AGTGGAACGC 
GGGACCCCAG ATTTCTGCCT 
ACACAGAACG CGCAAAACGC 



TGGGCAGATC 


ACGAGGTCAG 


45540 


TCTACTAAAA 


ATACAAAAAT 


45600 


CAGGAGGCTG 


AGGCAGGAGA 


45660 


ATCGCACCAC 


TCTATTCCAG 


45720 


GAAAGAAAGA 


AATTAACTCT 


45780 


TCATTCAACA 


GACATCTGTC 


45840 


TTAGAGGAAG 


AGGGCAGGGG 


45900 


AGCAGCTGGC 


AAGGAGCAGA 


45960 


GGCCACTGCT 


CTCAGGAGAT 


46020 


CAGGGTCTGG 


GCAATCTGCT 


46080 


CAATGAGAGG 


ACTTATCTGG 


46140 


GAGGCCGAGG 


CGGGTGGATC 


46200 


TCAAACTCCA 


TCTCTACTAA 


46260 


AATCCCAGCT 


ACTTGAGAGG 


46320 


GCAGTGAGCC 


AAGATTGTGC 


46380 


CAAAGAAAAA 


AGAATCATGG 


46440 


ACATGGCGGC 


AGGAGAAAGA 


46500 


TCCTGAGCAC 


TCACTGTCAG 


46560 


CCTCCAGGTC 


CCTCCCTCCA 


46620 


CACAGAGCCG 


AACCATATCA 


46680 


TCATTCAACG 


AGTGTGTGTG 


46740 


AGGCAGCCTG 


CAGTTCCTTC 


46800 


GGGCTTCTCC 


GAGCACTATC 


46860 


CCGTCAGTGG 


ACATTAAACA 


46920 


GTGCGCGGTC 


ACGCAGCGCC 


46980 


GAGCTGGGGC 


CGCCTGAGAC 


47040 


GGCCTCGCTC 


TTGGAGTCAG 


47100 


GGCCCGCCGG 


GACCACTGCG 


47160 


GGAGAGGGGG 


CGGTCAGGGC 


47220 


AATTTTCAGG 


AAAGCGGTCG 


47280 


GCCCGTGCAG 


CTCTCCGGGC 


47340 


AGACCCTGGT 


GGGGGAATGA 


47400 


GTGGGCGATG 


GAAGTGAGGT 


47460 


CGTGTAGGCC 


TGGAGGAGCC 


47520 
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GAAGAGCAGG CGGACCCCCT CCGCGGGGGA ACAGTTTCCG CCGGGAGCAC AAAGCAACGG 47580 

ACCGGAAGTG GGGGGCGGAA GTGCAGTGGG CTCAGCGCCG ACTGCGCGCC TCTGCCCGCG 47640 

AAAACTCTGA GCTGGCTGAC AGCTGGGGAC GGGTGGCGGC CCTCGACTGG AGTCGGTTGA 47700 

GTTCCTGAGG GACCCCGGTT CTGGAAGGTT CGCCGCGGAG ACAAGTGAGC AGTGAGTCGC 47760 

5 AGTGACCCTA CAAGTGGTTC TTTTACCCGA GCGGCTCGTA GGCGCGTTGC GGTTTTTCGA 47820 

AACTACAGCT CCCGGCAGGC CCCAAGCCGC CCTCGGGGCC GCGGGTCGGC GGATTGGCCG 47880 

CGCTGCATTT TGGGACCTGT AGTTTCCTGC GCTCGTGGCG CTGGCGCCGC GGCGTTGGCT 47940 

GAGCCCTTGA CCGGGGCTGG AGGGAAGGGC CGACATTCAG TGTGTCCGCG TCTGTTCTGT 48000 

TAGTCCCAGT TCCCGGGCGC GATTGAGGCT TAGAGAAGTT GAGTGATTTG CTGAGGGCTG 48060 

10 CACGGGTTGG CATCCCGGCA TGCTCTTTCG CTACTTTGGC TGCATCTGGT TGCCCACCCG 48120 

GGCGGATGGG GAATGGACTC CAGCCAGCCA GGAGGGCAGA GGGCTGGAGA GGCAGGGCCG 48180 

GAGGTTCAGA CCCTCCGCTC TGACGTTGCG CCTGGTGAGG CCGGGAGGGG TGCCGCTTGC 48240 

CTCTTCAGCC CTCACGCTCT TGTGGAAGTC GCGGAATTAC TGCAGGCGGA ACTTGCAGCA 48300 

CTGTGGGCGT CTTTTCCAGA GAAGGACGGA GTTGTGGGGC -GGGAGGATAA GGCAAGGCCC 48360 

15 AGCCACTTCG CATCTTCGCC CCGCCAGCTC CTCGAGATGG GATATACCAG GGTTGCTCTC 48420 

CAACCCTCTC CGCAGGAGGG ACTGATGGAA ACGCCTGGGA AAGTAGCCCG GTACCCACAA 48480 

AGGCTGTCTA CAAACAGAGT CTTACTGTCT TTCCCAGGTC TGTGCCATAG GGATTCTCGA 48540 

AGAGAACAGC GTTGTGTCCC AGTGCACATG CTCGCATCGC TTACCAGGAG TGCCCGAGAC 48600 

CCTAAGATGT TCGGAGTGGT TTTTTCGCAC AGACCCGAAT AGCCTGCCCC TCAGCCACGC 48660 

20 TCTGTGCCCT TCTGAGAACA GGCTGATATG CCCAAGATAG TCCTGAATGG TGTGACCGTA 48720 

GACTTCCCTT TCCAGCCCTA CAAATGCCAA CAGGAGTACA TGACCAAGGT CCTGGAATGT 48780 

CTGCAGCAGG TAGAGCACAG GCCCCGAGGA AAGGACTGCG GGTGGGTGGA GCTTCAGCCA 48840 

GGACGGGGTG TGCTTCCCTC TCCCGGCCCA TTCCAGCCAG GCCCCTCCGG GCCAGAGGCA 48900 

GCGTCTGTCA TAAAAAGGGC TGGTGTTCCA GGTGGGGTCA GAGAGAGGAT TGACAAGTAA 48960 

25 AAACGATCGT CCTTTGAAGG GGGCCGGCCC CTCCACACCT GTGGGTATTT CTCATCAGGC, 49020 

GGGACGAGAG ACTGAGAAAA TGAATAAGAC ACAGAGACAA AGTATAGAGA GAAAAGTGGG 49080 

CCCAGGGGAC CGGCGCTCAG CATACAGAGG ACCTGCACCG GCACCAGTCT CTGAGTTTCC 49140 

TCAGTATTCA TTAATTACTA TTTTCACTAT CTCAGCAAGA GGAATGCGGC AGGACAGCAA 49200 

GGTGATAGTG GGGAGAAGGT CAGCAAGAAA ACGTGAGCAA AGGAATCTGG GTCACAAATA 49260 

30 AGTTCAAGGG AAGGTACTAT GCCTGGATGT GCACGTAGGC TAGTTTTATG CTTTTCTCCA 49320 

CCCAAACATC TCGGTGGAGT AAAGAGTAAC AGAGCAGCAT TGCTGCCAAT ATGTCTCGCC 49380 

TCCTGCCACA GGGCGGCTTT TCTCCTATCT CAGAATTGAA CAAATGTACA ATCGGGTTTT 49440 

ATACCGAAAC ATTCAGTTCC CAGGGGCAGG CAGGAGACAG TGGCCTTCCT CTATCTCGAC 49500 

TGCAAGAGGC TTTCCTCTTT TACTAATCCT CAGCACAGAC CCTTCACGGG TGTTGGGCTG 49560 
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GGGGACTGTC AGGTCTTTCC 
AACCTTGGGC AATACCCGGC 
GTGCCCCTGG TTTATCGAGA 
AAACATATTG TTAACAAGGC 
5 TACAACACAT GTTTCTGTGA 
GGGCAAAGCA ATTGTTCAGG 
CATAGACACA GTAACAGTCT 
AGGAGTGCCT TGGGGGCCTC 
TTCCATTAGA GGGGCAGCCG 

10 GCAGGGAGGT GGCAGTGGTG 
ACCCGGCTCA CACCCTTCCC 
TATGTAGGAC GTCAGATAGG 
CTATTTTGAG ACTTGACACC 
TGTTTTGGGT AACGTTCTGC 

15 TAGGGCAAGA GTGGGACTTG 
TGGCAGTGGC CTCTGCATCT 
CAGCTTGTGT GGCCTCGCCT 
AGAAGGTGAA TGGCATCCTG 
GCACCACGCT GGCCTGGCGA 

20 AGAGGGCGCA AGGAGAGCTT 
CTGCTGCTGG AGACCCCATA 
GATGGTTGGC AAGGGATGGC 
TCAAGCAGAA CTCAAGGAGA 
AAACCTAGGG TTGGGCTTTT 

25 TTTGATAAAC CAGTATCTGG 
AATGCCACCT GCCTTTGATA 
TTGTGTCTGG TCAGCTTGCT 
CTCGCAACTC ACACAGGTCA 
AGTTTACACC TGTCTCGGGG 

30 CCCCATGCTG TGCTGTGGTG 
GAGCGCTGGT GCCCAGGGGT 
AGGAAGCAGC TTTGCATCTG 
GCTGTTGTAG CAGCTGTGCA 
GTAGGAGGAG GCCCTGCTGC 



CATCCCACGA GGCCATATTT 
TTTCCAGGGC AGAGGTCCCT 
CTGGAGAATG GCGATGACTT 
ATGTTCTGCA CAGCTCTAGA 
GCTCAAGGCT GGGGCAAAGT 
GTACAGGTCA AAATGGAGTG 
GATCTCTCTT TTCCCTACAG 
TCTGAGGAGC AGCTGATATT 
GTGGCCAGCC TCACTTGGAA 
CAGGTTGCCT TCTCCTGGGT 
CTCCCCGAGT TTCCTGCTCA 
ACAGCATAAA TGTTTGGATC 
TAATTAGTCA TCTTACTATT 
AAATCGTTTG CTAATGGCGG 
CCTGTGGACT TCTCCGCGGT 
GCAAAGAGCT GCCCGCTGGC 
CTTCCTGGCT TCCCCGTAAC 
GAGAGCCCTA CGGGTACAGG 
GAACACCTCC GAGACGGCAT 
TTCCCGGATC GGGCCTTGTC 
GGTGACCCTA GTTCCCAGGC 
GCTGAGGGTG GGGTGGGCCC 
ATTTTTTAGC TGCTGTATAA 
TTGCTGAATT AGGGCACGGC 
GGTGTCAGAT TCTTGGCTGT 
CGTGAGAACG TTGTCTGAGA 
ACACGGACAT CCCAAAGATT 
TCAACGAGCT TCGGAACACC 
TCCTCAAGAG AACCAGCTTG 
GAGGGTGGTG GTCTTTCTAG 
GGGGTGCGGC CTGGGCTGCC 
TGTGCTGACC TTGGGCGGGC 
GTAGGTCAGG GCTGGCCCCC 
TTGTCAGAGC CCAGCAGAGT 



CAGACTATCA 


CATGGAGAGA 


49620 


GCGGCTTTCC 


GCAGTGCATC 


49680 


TTACCAAGCA 


TACTGCCTGT 


49740 


TCCCTTAAAC 


CTTGATTCCA 


49800 


TACAGATTAA 


CAGCATCTTA 


49860 


TGTTATGTCT 


TCCCTTTCTA 


49920 


TCCTTGAGGG 


TGACAGACTT 


49980 


CACGGGTCAG 


GAGGAAGCAT 


. 50040 


GGTCTTTGAA 


CCTCGGGGGT 


50100 


TCCTTGAGGT 


GCCCTCTTGT 


50160 


GGTTCCCGTC 


TGAGAGCTTG 


50220 


CAGAAACGCA 


GAACAGTTTC 


50280 


TAAGCTGAAA 


AATAGTGTCG 


50340 


CTGAGTTGCT 


TCACGCCCTT 


50400 


CCCACAGGGC 


TCTCGCCACC 


50460 


TGCCGAAGCT 


TGTCTCAGGG 


50520 


CCTTGCTCCG 


AACTCCGTTC 


50580 


GAAGACGCTG 


TGCCTGCTGT 


50640 


CTCTGCCCGC 


AAGATTGCCG 


50700 


ATCCTGGGGC 


AACGCTGCTG 


50760 


CTCTCCTGGC 


CTCCTGTGGG 


50820 


ATGGGGACTC 


CTGCCGTCTC 


50880 


TTTCTCGCCA 


TCGTGGGTGT 


50940 


AGATGCCCAC 


TTCACCCATT 


51000 


CTGCAGGGCC 


GAGTTAGCCG 


51060 


ACCGTGACTT 


CTGTGCTTGC 


51120 


ATTTACGCCT 


CCAGGACCCA 


51180 


TCCTACCGGT 


GGGTCAGACG 


51240 


GCATGGTGCT 


GAGTCCACAG 


51300 


ACGCTCCCCC 


GAAGTGTGCA 


51360 


TCCAATGCCC 


ATTACTTGTG 


51420 


GTCCTGAGCT 


CCTCGCAGGT 


51480 


AGTGCAGCTT 


TGCACATGAA 


51540 


CTTGGTGTTC 


TGTCGGGTTC 


51600 
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CTGTGGCCGG ACCAGTGGCA GGGTGCTGTG 
TACCCCCGCT CGTCTTCTAG CTCCCTCCTA 
TCCTCTGACC GCCGAGGCTC CTGCGTGTCT 
TGGGTGTTGG AGACAGCTCC TCCTCGACCC 
5 TGTGTGCTGG GCTCCCGGGA GCAGCTGTGC 
AACCATCTAC AGGTAGGCTC CTGGGCTCCC 
TGCTGGGTGT CCAGAGCCCC AGGCTGCGCT 
GGGACTGCAG GGGAGGACCT GGTGGGGGTG 
CTTCAGCTGC GCACTCTGCC CTTCCTCCCA 
10 AGTCGCTCCT GTCATTTCTA CAACAACGTA. 
TCGGGTTGGA GTGTGTGCAG CCTCTCAGGG 
GCTTGCCCGG TGGGGCGGCC AGTGCGGCCA 
CCACCCATGT TAGACTTCTG TGTGGAAGAG 
CAAGACTGCC TCTGGCTGGT GCCTGGGGCC 
15 TGATGAGAGG GTCTCCCTGC ACCTAACCTG 
ACCGGCTTTC TCTTCGGGGA CTTTGTTGCT 
CCTCATACCA TCTTGGGAGG GTGTCCTGGA 
CGTGTGTCTT TTTCCTGTGC TGACCTTGCT 
AGGCTGAGTC CTGTGGGTCC AACACGGGAT 
20 GAGTGACAGG AGCCGGCCGG GGCAGCTTGC 
GGGCCGGGGG TGCtTCTCCA CCCACGTGGC 
GCTCATCACG GGGTCCTAGA GACAGTGGCA 
TTCTGTGACC TGAGGGTGGC ATCTGTGCAG 
GGTTCCCTCT GTTTCCTGCC TCAGTTGGGG 
25 ACTCAGAAGG CATCGGCGGT CCTGTGGGCT 
AGTGTGCCAG GACTGAGGAC CCTGAAGCTG 
AGAGACTGAG CCAGGTGGTC CCGCATGACC 
TGGAGGGACA GGGTGGGCGG AACATGGGCC 
GTGCTGTGTT GCTGGAGACA TCCTCTGTGT 
30 GTGGCACCAA GGACCCTCAG TCATCTTGAT 
TAAGAAGGGG CTCTGCCTTT GTGCCCCCAG 
GGTGATGCCT GCAGGACGTG ACTCTGGGAC 
TCTCAAGCAG GCGTCATTTC CGAGGTCCTC 
GGGTGTCCCC AGTCCCATGC AGCGGGGGTC 



GAAGCTGTCG 


AATCTCCTCC 


CTCTGTCCAG 


51660 


CGCCCGGGCC 


ACGTTTCAGT 


TATGCTCACT 


51720 


CCATACAGCT 


CACGCTGCAG 


GGCCACGCTG 


51780 


ACGGTGCTCT 


CTCCCACCAG 


GCCTAAGGTG 


51840 


ATCCATCCTG 


AGGTGAAGAA 


ACAAGAGAGT 


51900 


GCTCCGGCTC 


AGTGTCCGAC 


AGGCGAGTGC 


51960 


CCCGCTGGGC 


TAGGGTTTGA 


AGTTCACTGG 


52020 


GGGACTGGCT 


TCGGTCCTTT 


CTTGGCCGTG 


52080 


CAGATCCACT 


TGTGCCGTAA 


GAAGGTGGCA 


52140 


GAAGGTACAA 


GCAGCTGGGT 


GGGACCAGGG 


52200 


TGGAGCTCAG 


TGGTGTCACA 


GCCTGGTTGT 


52260 


TGTACCTGGG 


CCCTGTCTTC 


TGACTCGGGG 


52320 


CTCACACAGT 


GGTCTGAGAC 


AGCCAGCCGG 


52380 


TTGGATTTTG 


GGAAGGCTCC 


CTCCATTTCC 


52440 


CTGGTGCAAA 


CAGTAGGGGT 


TTTGCTGAAC 


52500 


TGCCCAGCAG 


CAGGTGCTCC 


AGTGACCGGC 


52560 


AGCCGTGTCT 


GGCCTCCCGC 


GACCCTGCCC 


52620 


GCGGAAAATT 


ATGGCCCTGA 


GTGTGACTCC 


52680 


GCCTTGGGGC 


CTCTTCTGGA 


GACGGGATGT 


52740 


CCTGTGACTG 


CACGTGGCCA 


CAGCCTGTGA 


52800 


TGCCCCTCGG 


GTATGTCAAG 


GGCTTCTGGG 


52860 


GGGTGCACCC 


CCGTTGGCTG 


CCCTTACAGT 


52920 


TCGGCGCGGT 


CTGTGCTTCT 


GTGGGATCAG 


52980 


CTCAAGCCTC 


AGGTGAGGTG 


GCCCCGGAGC 


53040 


GCTTTCTGCA 


CTCACGTTTG 


CTGAGTGCTC 


53100 


CTCTTGTATT 


TAGGGCGGCG 


CTCCCCTGGC 


53160 


CACTACCAGG 


CGTTTCTGGG 


CCCTGGCCCT 


53220 


TGCAGGGAGG 


CTCCCGCTTA 


CTGGAGGCAT 


53280 


TGCTTCTTGT 


TCGCTGTGGT 


TTTTGGTCTG 


53340 


GTGTGGTTGT 


CCAGGCCTTT 


TTGTTGGTCC 


53400 


GTTCCCTGAC 


AGGAGCTGCC 


GGCTCGTCCC 


53460 


GGGGGGTTGG 


GCAGATGTGC 


TGATGGAAAT 


53520 


ACCTGGATTT 


CGAGGACAGG 


AGTGCCTGCT 


53580 


CTTGGGATAG 


; CATGGAACGC 


TGAGCATGGG 


53640 
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CCTGGCCGGC CGTGGTCCTG GACAAGGGCA 
GGTGGGGACG CTGGGCCTGG TACCTGGTGG 
CCTCTGACTG CCTCCTGGTG CTGCTTCCGT 
CATCTGTCTC CTCCTCCAGG CCTGTGGACT 

5 CTGTCCCCCT TGGCTTGGGC TCTCACTGCC 
CCTTCGCACC CTGTCTGCCA CCGGACAGGC 
GCCTGAGCAT CTTCAGAGTG GGCTGCAGCT 
GTATTTTGGG CGAATGAGGA GACAGCTGGA 
TGAGGACTCT GGTTGGGGAG AGCAAGCTTG 

10 GCCCCGCCCC TCGTGCCCCT TCCCCTCGCA 
■CGCCAGGCAA GGTTTGGCAA GTGCCGCTGT 
TGCTATGGTG CTGCCCAGGG GTGTGCTTTT 
GTATCTGGGG TTGGGTCATC GCTGGTGTGT 
GTGCATATGT GTGCGCTTCT GGCCTCTGCA 

15 CCTCCTGGGG ACAGGCACAA AGCAGCCATG 
GCCCCACGTG GCTGCCCTGT GACTCTGGGG 
TGTGTTTAAG CAACTCCCTA AGGAATTCTT 
GAGGCAAGGG AGGGGCAGGA CATCCCCCAG 
GGCCTCTCCG CCAGTACAGG CAGTGTCCTT 

20 GGGCCACTGT TTAGCCCAGG AAAACCTGGC 
GTGGTTGGCA GGGAAGCTGA GTGACGGTGC 
GGAGATGCCT CTGTGGGTTG GGGCGATAGG 
GGGCTGAGTG TAGCCGCCAC TCCTGTCCTG 
ACCCTCGGGA TCCTGCTGGC CTCCGTCTGG 

25 AGAGGAGGAT tCCCTCAGGA TCACTCGGGT 
AGTGGGTGCT TTGTGGCGCG TTCACGGTTG 
AGGAGCCGTT CTGTCCTGGG CATCCCCCTG 
TGTGTTTACC CCTGCCTCAC' ACCTGCAGAA 

atcctggaca ttgaggactt ggtcaagagc 
30 aggccacgac cactgtcctt ccatggccca 
tatttcttca cttttctttg ttccttttta 
aaaagtttaa gcagctctga ggaagataga 

TCAGCCATCC CCGGTGGCCA CCGTGGAGTG 
CGGAAGCACG CGCAGTTTTG TCTGCACAGA 



gtgccccggt 


GGCTGCTGGG 


CCTGGGACCT 


53700 


GGATGCTGGG 


CCTGGGACCT 


GGTGGGGAGG 


53760 


CTGTGTTAGG 


CCTCTGGGTA 


TTGGGGCCCC 


53820 


CAGACCAGGA 


AGACACAGGC 


CAGCCCCTGC 


53880 


CGACCTGGCG 


GGAGGTTGCC 


TAGCCGTGAA 


53940 


TGTGAGGGGG 


TGTCTGCAGC 


ACCTGCACCG' 


54000 


CCTGGAGGGG 


TCTGAGAGGA 


AGGGAGGCAG 


54060 


GAGCTGGCAC 


CCTTCCTGGC 


CTGCGTCCTG 


54120 


GGGTCAGGCT 


GGGGCAGAGC 


CTCTGGGACG 


54180 


GCTCCTGTCC 


TCGCCCCGCC 


CTCAGCTCTC 


54240 


GCGGCAGTGC 


CTGCTGATTG 


GCTGGTGTGT 


54300 


CCTCCCCTGC 


CTTCCCTGCT 


ATCCCTGGGA 


54360 


GTGAGTGTGT 


GTGTGTGTGT 


ATGTGCACGT 


54420 


GCTGAGTCCT 


GGCGCTCGGG 


GGGCCTGGCA 


54480 


ATGGAGTCGG 


GAGCTGGGGG 


AGGCCCCATT 


54540 


TGCTTGTTAG 


AAGAGGTATC 


TGGTTCTGTC 


54600 


GTGGTTCCAG 


TTTGGGGGGC 


CTGTACTGTA 


54660 


ACTCTGACTT 


CTGAAGCCTT 


TTCTGCCCGG 


54720 


TGCCAGGGCT 


GCCATGCTGC 


AGAGGGGAGT 


54780 


TCTCCCTTAG 


CTGGAAGTTC 


TGGGCCTGTT 


54840 


TAATCACAGG 


GGCACCTGCA 


GGGGTTTGTG 


54900 


CTGAGGGGCT 


GTTCTTCCCT 


GCCCTGAGGA 


54960 


TCTTGGGCTG 


TCTCGGAGAG 


GATGCGTAGA 


55020 


TCCACCCTGA 


ACCTCAGGCC 


TTCTGGGGGC 


55080 


GGGGGCCTCT 


CTTGGGCACC 


TGAGACCCTC 


55140 


GTGGGGGACG 


CCCAGCCCTG 


CCCGCCGTGT 


55200 


TGGTCTGGGA CTTAGTGGAC 


CCTGAGGGTG 


55260 


AAAAGCCTGG 


AGCAGGAGCT 


GGCCAGCCCC 


55320 


GGAAGCAAGC 


ACAGGTGAGA 


CCCCTCAGTG 


55380 


GCTCTCCTGT 


GACCTGTGGA 


GGCCCGGATA 


55440 


AATTATGAAA 


CTAACCACCA 


TTCAGTACGA 


55500 


GTAAAAAATT 


GTCTCCCTCT 


TCCCTGGCCC 


55560 


TGGACGGAGC 


CCTGCAGGCC 


TGTGTCTGTG 


55620 


CTGTCCTGCA GTTGGCTGTT 


TTCACTCAGC 


55680 
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GTTGTGGGTA TAGCTTCCCA 
CAGATGTCTC CTATGTCTAC 
CTTATATTGA GGGGGAAACC 
CCCAGGCTGC AGTGCAGTGT 
5 AGCGATTTTC CCACCTCAGC 
TGGTTAATTT TTGTATTTTT 
CTGAGCTCGA GAGTTCCACC 
CCCCATGCCT GGCCAGCTCT 
GTCGCCCAGG CTGGAGTGCA 

10 TTCACGCCAT TCTCCTGCCT 
CGTCTGGCTA ATTTTCTGTA 
GTCTCGATCT TCTGACCTTG 
GGAGTGAGCC ACCGCGCCCG 
GTTTCTATCC AGGAATAGGG 

15 TTTACATGGT CGGAAACTCA 
CCCCATGCCT TGGTCAGAAA 
CCTGGGCCCT ACGTGCAGGA 
GCACCGTCCC CTGCATGGGG 
CCTTACTACC TGTCCCGGAA 

20 TACTTGTTGG ATGCCAAGGT 
GGTGTCCCTG GGCTTGGGAA 
CAGGCCTGGG TGGGAGGACT 
ACTTGTGAGC AGCCCCAGGC 
GGTGGGAGTG AGGAGAGACC 

25 TGACAGCTCA GGTTCTCTCC 
CCACTGACTC CCCTCTTCCC 
GTGCCCGGCT TCACACTCAC 
CCCCACAGCA TGTGCCCGGC 
ACTCCCTTCC GCCCCGTGTG 

30 CCCCACACTC AACTCCCCTC 
TGCCCTGCTT TTGTGCCCCA 
CAATGGTTTG TGTGTTCACG 
TCGCTCATTC TCGGGGTGTA 
ATAGCAAGAC TGGCTTCACT 



TGCTGGTGCT GGCAGCTCGG 
CTCTTACAGC TTCAGAGATT 
TCCCTCCCCC TTTTTTTTGA 
CACAGTCTTG GCTCACTGCA 
CTCCCAAGTA GCCGGGACTG 
TGTACAGACA GGGTCTCACT 
TGCCTTGGCC TCCCAAAGTG 

GTGGTGCGAT CTCGGCTCAC 
CAGCCTCCCG AGTAGCTGGG 
TTTTTAGTAG AGACGGGGTT 
TGATCCGCCC ACCTCGGCCT 
GGCCAGCTCT GCTTTTTCTT 
TTGGTTTTAC TTTTCCATCG 
CGAGGACTCC CCATCCCTTG 
CTCACGTGGA CTCCCATCCA 
TGAGGGCTCC TTCCGGGTCA 
TGCTCACAGG ATCTTCTCCT 
CCTGAAGCAG CAAGCCGACA 
GGGGGCTCAG TCCTGTAGCT 
CAGCTGTCCG AGCCTTTGCT 
CACCTCTGTC ACTGGGCAGG 
CACAGGTCAG TTTTCTGAGC 
TCCAGGCTGT GGTCCATAGG 
TTCACGTCAG GCCATGGGAG 
AGCGCGTGCC CGGCCCCACA 
TCCCCTCTTC CCAGTGCATG 
CTGACACTCA CTCCCCTCCT 
CCCAGCCCCA CGCTCACTCC 
CTCCCAGTGT GTGCCCGGCC 
CACTTTTTAC TTAGTGCAGG 
TGACGATGGC GTGGTGACGT 
TATTTATTGA GAGCTCATCA 
CACATGGAGC TTTGATTCTA 



CCTTGTTCTT 


TTGAGGACAG 


55740 


CAAGTTATAA 


TAAAGCTCTT 


55800 


AACAGGGTCT 


CGCTCTGCTA 


55860 


GCCTCAGCCT 


CCCAGGCTCA 


55920 


CAGGCACGCA 


CCACCATGCC 


55980 


CTGTTGCTCA 


GGCCAGTCTC 


56040 


CTGGGATTAC 


AGGCGTGAGA 


56100 


TTGAGACGGA 


GTCTCGCTCT 


56160 


TGCAAGCTCC 


GCCTCCCGAG 


56220 


ACTACAGGTG 


CCCGCCACCA 


56280 


TCACCGTGTT 


AGCCAGGATG 


56340 


CCCAAAGTGC 


TGGGATTACA 


56400 


AGTGGTTCTG 


CGTTGTGTTT 


56460 


AGTTTTTAAA 


GAGACGACGA 


56520 


GTCGGAAACT 


CACATGGACT 


56530 


TCCCAGGCAG 


CAGCTTCCCA 


56640 


GAAGACATGG 


CGGCCTCGGG 


56700 


CTCTCCTTCC 


CAGGGTGTGC 


56760 


TCATATTCAT 


GCCGTACAAT 


56820 


GACGACTCCT 


GATGTCCAGG 


56880 


GCTTCAGGGC 


CTTAGATCAG 


56940 


GGCTCAACCT 


GGCCAGACAC 


57000 


AGTCTGGGAG 


CGGGCAGGCT 


57060 


CCAGTGCCCG 


CTCTTGATCC 


57120 


GCACCGAGAA 


CACAGGAAGC 


57180 


CTCACTCCCC 


CTCCCAGCAT 


57240 


CCCGGCCCCA 


CACTCACTCC 


57300 


CCCAGTGTGT 


GCCCAGCCCC 


57360 


CCCCGCCAGC 


ATGTGCCCGG 


57420 


CTGCTGCCCT 


CCTCCCCATG 


57480 


TGGGATCACA 


CGCCACGGGT 


57540 


TTCCAGATCC 


CGTCGTTGGT 


57600 


TGCTGGGTGC 


TATTCCAGGC 


57660 


GTGGTGGGGA 


CAGGTGGACA 


57720 
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GCAAAAGAGT AAGCACGTGA GCTGACGATA CTGAAGGGAA ATAGAGCAGA GGGAGGAGGC 57780 

GGAGACCGAG CCAAGCGGGC CCAAGTGCGA TGTCGGCGGG AGGTGGGGAA TGCTGGTGGG 57840 

TCTGAGGGGA GCCTCAGCAG GTGCAGCAGA GCAAGGGAAG AGGTGAGTGG GGGCGGCTGG 57900 

GGGGCCGACT CCTGGGAAGC TGTAGCAGAA CCCCACAGAG AGCTGGTGAG GTTTGCCGTG 57960 

5 GTTGTGGGTG ACTCGGTGCT TTGAGCCCTG GCTGCCCCTG GGAACCATCT GGAGAGCTTC 58020 

TAACCCAACC AGGCCCCTCC CTGGGACAGT TATATCACAG. CTGGTAAGCC GAGTCTAACA 58080 

CTTTCACGGA AACGCAGAAG ATCTAAAACA GCAAGATGAC CGTGAAGAAG AACAGAGCTG 58140 

GAGGACTCAC CTCGCTGGTT TCAAGACTCC TCTAAAGCTG CAGGAGTGGA GGTGGAGATG 58200 

GCCCAGCTCA GGCACAGGCC TGCAGGCCAT GGAGAAGGCA GCAAGCTCAA GCTGACCCAC 58260 

10 ACGCATGTGG TCATTGTTTT TTTTTTCAGT TGGAATCTCA CTCTGTCACC CAGGTTGGAG 58320 

TGCAGTGGCA CCATCTCGGC TCACTGCAGC CCCCGCCCCT AGGTTCTAGC GATTCTCCCA 58380 

CATCAGCCTC CCGAGTAGCT GGGATTACAG GCGTGCGCCA CCATGCCTGG CCCTTGGTGA 58440 

TTGTTTTTTG ACAAACATGC CAATTTAATT GAGAGAGGAA ATGAAGGTTG ATTTCTGGTT 58500 

TTCTGAAAAA ATGGTGCTAA GAACAGCTGG ATATCTGTTC GGAAAACAGT GAATCTTAAC 58560 

15 TCTTGTTTTA CCCTGTATAA ACCTAAATGT AAAAGCTAAA CTAAAAGTTA TAGAAAGGAA 58620 

CATGGGGGAG GTCTTTGCAA CTTTGGGGTA GGCAGAGATT TCTTAGTATG GATACACAAG 58680 

GCACTAGCCA TGAAGAAAAA CATTAAAATT TAGACTTCAC CAAAATTTAA AGCTTCAACT 58740 

CTGTGGAAGA GTTGAGAAAA TGAAAAAGCA GTTAAAGAAA GGGAGAAAAT ACTTCTTTCA 58800 

AAGGACTTAA AAAATTTTTT CAGCCCTCCT CTGATTTGAA AGGACCTTTG ACCAGAGTAT 58860 

20 GTAAAATTCT CCCATAACTA AGCAAACAAC CCACTTAACC ACTGiGGAAGG GATCTGGACA 58920 

GACGTTTCAC CAAGATGGGT GGAATGGGCA GTTAACCACT GGGAGAGCAT CCGGACAGAC 58980 

GTTTCGCCAA GATGGGTGGA ATGGCCAGTT AACCACTGGG AGAGCATCCG GACAGACGTT 59040 

TCGCCAAGAT GGGTGGAATG GCCAGTTAAC CACTGGGAGA GCATCCGGAC AGACGTTTCG 59100 

CCAAGATGGG TGGAATGGCC AGTTAACCAC TGGGAGAGCA TCCGGACAGA CGTTTCGCCA 59160 

25 AGATGGGTGG AATGGCCAGT TAACCACTGG GAGAGCATCC GGACAGACGT TTCGCCAAGA 59220 

TGGGTGGAAT GGCCAGTTAA CCACTGGGAG AGCATCCGGA CAGACGTTTC GCCAAGATGG 59280 
GTGGAATGGC CAGTTAACCA CTGGGAGAGC ATCCGGACAG ACGTTTCGCC AAGATGGGTG 59340 
GAATGGCCAG TTAACCACTG GGAGAGCATC CGGACAGACG TTTCGCCAAG ATGGGTGGAA 59400 
TGGCCAGTTA ACCACTGGGA GAGCATCCGG ACAGACGTTT CGCCAAGATG GGTGGAATGG 59460 
30 CCAGTTAACC ACTGGGAGAG CATCCGGACA GACGTTTCGC CAAGATGGGT GGAATGGGCA 59520 
GTTAACCACT GGGAGAGCAT CCGGACAGAC GTTTCACCAA GGTGGATGGA ATGACCAGTT 59580 
GAGCACATGG AAAGTCGCCC AGCATCTCCA GTCATAGGAG AAGGCAGATT AAAGCCACGG 59640 
GGAGCCGACA CTGTGGTCCC ACTGGCATGG CTGAAATTCA GAAGCCCTGA GTGTGGCATG 597O0 
AGGATGTGGA ACAGCTGGAT CTCATCCATC GCTGTGAAGT TGTGTAGCCA CTCCACAAAC 59760 
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GTGTGGCAAA CAGCCGAGCC GGGAGAAGGG 
GGCTCAGTGG CTCACGCCTG TAATCCCAGA 
AAGCCCAGGA GTTTGAGACC AGCCTAGGCA 
AAAAAGAAAA AAGAAAAGAC TTCAGTGTGC 

5 AAAACTGGGA AGCAGCCCGC GTGAGCCCAT 
GAACACTAAC AGCAGCCACG GGCGTGGACT 
TCTCGGACAT GCTAACCCAG AGAGGCCCAT 
GTTTTTTGTT TTGAAATGGA GTCTCGCTCT 
CTTGGCTCAC TGCAGCTTCC GCCTCTTGGG 

10 AGTAGCTGGG ACTACAGGCA CCCGCCACCA 
GACGGCAGTT CGCCATGTTG GCCAGGCTGG 
CTTTGGCCTC CCAAAGTGCT GAGGTTGCAG 
TGTTTTATTC CATTTATGTG ACACTCTATT 
TATATGGTGC ACATAACTGC CAGAACTCAG 

15 AATTTTTTTT TTTGAGACGG GGTCTTGCTC 
TCTCCACTCA CTGCAAGCTC TGCCTCCTGG 
GAGTAGCTGG GACTACAGGC GCCCGCCACC 
ATTTTTAGTA GAGACGGGGT TTCACAGTGT 
GTGATCCGCC TGCCTCGGCC TCCGAAAGTG 

20 GGCCAATGTG TGAAAATTTA AAAGTACCAA 
ACACTCTGTG GGTGGGACCT GGGAGTTGGG 
GAAGTCTCAC TCTGTCGCCT AGGCTGGAGT 
TCTGCCTCCC AGATGAAAGC - GATTCTCCTG 
GCACACACCA CCACCCCCTG CTAATTTTTG 

25 TTGGCCAGGC TGGTCTTGAA CTCCTGACCT 
GCTGGGATTA CAGGCGTGAG CCACCGCGCC 
GAGTGGGGCT GGGGCAGGGA ACTGCTGGGT 
CTTCCTGACT GCGGTGGCCG GGGGCTCCCA 
TGGCACGTGC CTTTCCATGC TGTGGAGGAG 

30 GGTGGGGCGG TGTTGCCAGG TCATCCAGGT 
GTTGCTCATC TGCGCTTGTG ATGTTCGATG 
CCCGGCCTCT GTGAGCTCCA GCGCTGCGTC 
ACACAACATT GACCTGAAGG GGACAGTCGT 
TCTCCGCTGG CCTCCTAAAC ACCTCCTATT 
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AAGACGTGTT 


CAAAGATTCA 


TATGTGGCCA 


59820 


ACTTTAGGGG 


CCAAGGCTGG 


GGGATCGCTT 


59880 


ACATAGGGAG 


ACCCCATCTC 


AAAAAAAAAA 


59940 


AGGTTTACCA 


GAGTTTTGTT 


TGCAGTTGCC 


60000 


CCACAGGTGA 


ATGGACAGAC 


CGTGGTACCC 


60060 


GTGGTCACAC 


AGCAGCAGGG 


AGCCGATGAG 


60120 


TGAGGAGGAC 


CTACTGTTTT 


TTGTGTTTTT 


60180 


GTGGTGCAGG 


CTGGAGTGCA 


GTGGTGTGGT 


60240 


TTCAAACAGT 


TCTCCTGCCT 


CAGCCTTCCG- 


60300 


CACCCGGCTA 


ATTTTTGTAT 


TTTCAGTAGA 


60360 


TCCCAAACTC 


CTGACCTTGT 


CATCCACTCA 


60420 


GCATGAACCA 


CCGCACCCGG 


CTGGACCTAC 


60480 


AATAGAAAAG 


GCAGGGGTGG 


GGCTGGTGGT 


60540 


TACACTTAAA 


ATGAACATCT 


TAATGTGTGA 


60600 


TGTCACCCAG 


GCTAGAGTGC 


AGTGGTGCGA 


60660 


GTTCACGCCA 


TTCTCCTGCC 


TCAGCCTCCC 


60720 


ACGCCTGGCT 


AATTTTTTTT 


TTTTTTTTGT 


60780 


TCGCCAGGCT 


GGTCTCGATC 


TCCTGACCTC 


60840 


CTGGGCTTGG 


AGGCGTGAGC 


CACCATGCCC 


60900 


AGCTGGACCC 


CACCCCAGAT 


TGCTCCCATG 


60960 


TTTTGTTTTG 


TTTTGTTTTG 


TTTTTGAGAT 


61020 


GCAGTGACAC 


AATCTCGGCT 


CACATTAACC 


61080 


CCTCAGCCTT 


CTGAGTAGCT 


GGGATTACAG 


61140 


TATTTTTAGT 


AGAGACGGGG 


TTTTACCATG 


61200 


CGTGATCCGC 


CCGCCTCGGC 


CTCCCAAAGT 


61260 


TGGCTGGGAG 


TTGGGTTTGT 


AAATCTCCCT 


61320 


CTGGGTCTTC 


CTGGCTCCTC 


TGGTCTGTGG 


61380 


GGGCATCGTG 


GCCGTCTGTC 


TTGCTGAGCG 


61440 


CGTCTCCCGG 


TATGGCGAAC 


TGCTGGTTAG 


61500 


CTGGCCTCTG 


CTCTCGACAT 


CGCCGGCGCT 


61560 


CCTGCTGCAC 


ATGTCTTGGC 


TTCCCTCTTT 


61520 


CCTTCTCTTC 


CTCCTGTAGA 


GCCGCAGAGC 


61680 


GATCTTTGAC 


GAAGCTCACA 


ACGTGGTGAG 


- 61740 


GCTTCTGGCC 


TTTTTGTCAA 


GAGCCACGCA 


61800- 
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AACCTTTCTG GAGGGGCTCT 
TGAGCACACC AGGAGCTTCT 
TGGAGACTGG CCAGCTGGGC 
TCACCAGACC CTTTCCCTCC 
5 GCCCTCGGCA GCAGCAGGCT 
GGCTTGAGAC TGAGGCAGGA 
GGATCACACC ACTGCATTCC 
ATGTTGACTG GGCGCGCTAG 
GCGGATCACG AGGTCAAGAG 

iO ACTAAAAATA CAAAAAAATT 
AGGAGGCTGA GGCAGGAGAA 
TCACACCACT GTACTCCAGC 
CAAAAAGAAT AATTGGCAAT 
CTCCTTCTGT CGTCCAGGCT 

15 CTCCTGGGCT CAAGCCATCC 
CACCACCACG CCCGGCTAAT 
CAGGCTGGTC TTGAACCCCT 
GGGATTACAG GTGTGAGCCA 
GGGCCACACA GTCATGTTTG 

20 TGGGAACTTG GCTGTCAGCC 
GAACCCCCGA TCCTGTCCTG 
TGTGTGAAGA ATCGGCATCC 
TCATAGACCA GGTGCTGGAG 
AGTTCAGCGC GGACTCCCCC 

25 CCTGCGCTGC TGCAGGGTGA 
GCGTTCATAG CCAGGCTGCT 
TGTGGCTGTG TGTGGTCGGC 
CTTCTCTCAG AGGCTTCTTG 
GGTATTGTTC AGTAGTTCTG 

30 ACTTGGCCTC TACTGTGTAA 
CGCTTGAGGC CAGGAGTTTG 
AAAAATTAGC CAGGTGTGGT 
GGGAGGCGTG CTTGAGCCTG 
TCCAGCCTGG GCGGCAGAGT 



GGCCAAACTC CTGAAGCCCT 
GCCACCCCCT CCCGCCCTGA 
CAGGGACCTG CCCGTCAGGC 
AGCCAGCTCG GGGTCAGCCT 
TGTGGTCTTG CCTGCAGTGT 
GAATCGCTTG AACCTTGGAG 
AGCCTGGGTG ACAAAGCGGG 
CTCATGCCTA TAATCCCAGC 
ATCAAGACCA TCCTGGCCAA 
AGCTGGGCGT GGTGGCGTGT 
TCACTCGAAC CCAGGAGGTA 
CTGGTGACAG AGCAAGACTC 
TCCAGTGAAA TAATTGTTTG 
GGAGTTCAGT GGTATGATCT 
TCCCACCTCA GCCTCCCGAG 
TTTTGTATTT TTTGTAGAGG 
GAGCTCAAGT GATCTGCCCA 
CCGCGCCCGG CCTGAAACAA 
GACCTACTTG TGGCCTTTTA 
TCCTGTGCCT TCTGCACCCC 
TTCTGTGGTG ATTCGGGTGT 
TTTGACCTGA CTCCCCATGA 
GAGCAGACCA AGGCAGCGCA 
AGCCCAGGTG CGTTCATAGC 
GCCCCACCCG 'gAGTTCAGCA 
TGGTCCTGAG GCCCGTGCTA 
CTGGGCACCA TCTATTCAGG 
GGTGTGAATT CTTCAGGGTC 
GTATTTTCCA AAGACCTATG 
AACTGGAAAA CTCTACTTTG 
AGACCAGCCT GGGCAACATG 
GGTGTGCTCC TGTGGTCCAA 
GGAGGCAGAG CTTCCGGTGC 
GAGGCCATCT CAAAAAAAAA 



AGGTGCCCAG 


GACTGGGGAC 


61860 


TCCGATGCCT 


CTGCTGGGGC 


61920 


GCAGGGCCCC 


CACAGGCCGC 


61980 


GGGCCAGGGC 


TGTCTCCTCT 


62040 


CTCTGCCCTT 


CCGGCCACAT 


62100 


GCAGAGGCTG 


CAGTGAGCCA 


62160 


ATTCTGTGTC 


AAAAAAAAAA 


62220 


ACTTTGGGAG 


GCTGAGGTGG 


62280 


CATAGTGAAA 


CACCGTCTCT 


62340 


GCCTATAGTC 


CCAGCTACTC 


62400 


GAGGTTGCAA 


TGAGCCAAGA 


62460 


CGTCTCAAAA 


AAAATAAAAT 


62520 


TTTGTTTGTT 


GAGACAGGGT 


62580 


TGGCCCACTG 


CAACCTCCAC 


62640 


TAGCCGGGAC 


TACAGGTGCA 


62700 


CGGGGTTTCC 


CAGCGTTGCC 


62760 


CCTTGGCCTC 


CCAAAGTGCT 


62820 


TCGTTTCTAA 


ATATTGGTGT 


62880 


CAGACCCCAG 


GCCAAGGCTT 


62940 


CACCCCATTT 


CTGCTTTCTG 


63000 


GCTTGGGCTC 


TAGGAGAAGA 


63060 


CCTGGCTTCA 


GGACTGGACG 


63120 


GCAGGGTGAG 


CCCCACCCGG 


63180 


CAGACTGCTT 


GGTCCTGAGG 


63240 


CGGACTCCCC 


CAGCCCAGGT 


63300 


CTGCAGTGGG 


CAGCCTGCCC 


63360 


CTGGCACTGC 


AGGGCATCCG 


63420 


CTGTAGCCTG 


TGGAAGGGCT 


63480 


TCTTCTCCCA 


GCCAGTATCA 


63540 


TGAAGCTGAG 


TTGGGAGCAT 


63600 


GCGGAACCTC 


GCCCCTGCCA 


63660 


GCTTTTCTGG 


AGGCCGAAGT 


63720 


CCCAGATGAC 


TCCACTGCAC 


63780 


AAAAAGGAAA 


ACTAAATATA 


63840 
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TTCACTGTAA GGGCATTTTG CATCTTTAAA 
TCTGCCTGTA GGTTCCTTCC CAGTGTTTGT 
ACGGCATATG TGGAAAACGT GGAAACCCTT 
TTTCTTTTTT TTTTTTTTGA , GATGGAGTTT 
5 GCGCGATCTT GGCTCACTGC AACCTCCGCC 
CCTCCCAAGT AGCTGGGATC ACAGGCGTGC 
AGTAGAGATG GTTTCTCCGT GTTGGCCAGG 
CACCCGCTTC GGCCTCCCAA AGTGCTGGGA 
TGTCCATATT TTCTACATGG CTTCTGTAAA 

10 CATAGGTTCT GCTGTGACAC TACTTGCTCG 
TTCCCCCATT TTCCTTTAAA ATCGCACATG 
GGGGAGCCCC GTCCTCACCC TTCTCACCAG 
AAGTTGGCAT TTGTTTAGTA CAGAAGTTAT 
TATATATATA TATACATATA TTTAAGTGAC 

15 GTGGTGGTAC AATCAAAGTT CCCTGTAGCC 
TCTCAGCGTC TTAAAGCGCC GGGACCACAG 
TGCCACGCAG GGAGTTGCAG TGGGGGAAGG 
■ CTCCTCTCCA CAGTCCCCCA ACCCCACCTC 
TCCTTCGCTT GGCGCAGGGT GGGGAGAGTG 
• 20 CCATTTCATC GCTTACATAA CGTGGGAGAA 
ATCACCACGG GGCCTTCTCC ACTCTTCAGT 
TGTTGTAACT AGTGGGGGAC GTGTGGTGGG 
AACTTCAGTT TTCATCCCTA TCTGTTCCCC 
CACCCAGGCT GGAGAGCGGT GGTGCCATCA 

25 CTCCACCTCT TGGGCTCAAG TGATCCTCCT 
GGCGTGTGCC ACTGTGCTTG GCAGGGTCCA 
GCCCAGGCCG TTTACCCTGC AGAGTCGGAA 
CCCGGTTTAC TCTGAACTTA GGTGGCTTGA 
CCGGGCTCCA GATGGGACGT CCTTTCTATC 

30 CTTCCCATGT GAACATTGTT GCTGCTGAAC 
AAGATGGCAG TGTGCTAATT CCATCATCTA 
CCCTGCACCA TATCAGCTAT GTGGTGATCC 
GGGGGTGGGC CGGGTGTGCT GTCTCACACC 
AGGCGGATCA CGAGGTCAGA GATTGAGACC 
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TGACCCACAA 


ATCTGGCATG 


CATCAGCTGC 


63900 


CCAGAGGTGT 


ATTTCCACAC 


AGCGCTAGTC 


63960 


CATGGATGTT 


GTCAGTTGGT 


CTATATTTTC 


64020 


CACTTTTGTT 


GCCCAGGCTG 


GAGTGCAATG 


64080 


TCCTGGGTTC 


AAGCAATTCT 


CCTGCCTCAG 


64140 


ACCACCACGC 


CCAGCTAATT 


TTGTATTTTT 


64200 


CTGGTCTCGA 


ACTCCTGACC 


TCACGTGATC 


64260 


TTACAGGCGT 


GAGCCGCCAC 


GCCCGGCCTT 


64320 


CAGCTGACTA 


GGAGTCTGTG 


TGAATATCTT 


64380 


TGAGCATCTC 


CAGGTGTAAA 


CAGCATCAGC 


64440 


TGGACGGACA 


CCACGGGGAC 


CCTGGACCCT 


64500 


GATGGCTGCT 


TGGTAGAGAG 


TGAGTTTGCA 


64560 


CAGGTGTTCT 


GGCTTTAGAA 


TCCCTTTATA 


64620 


AGGGTCTCAC 


TCTGTTGCCC 


AGGCTGGAAT 


64680 


TCGGCCTCCT 


GGGCTCATGG 


GATCTTCCCG 


64740 


GTGTGCACCA 


CTGCCACCGG 


CTCTCAAGAT 


64800 


GGTTCCTGGG 


ACTTTGAACG 


CTCCACCTCC 


64860 


TCTAACGGGG 


TGGACGGCCG 


CCTCTTTCCA 


64920 


ACAGGTCTCC 


TTCCCTCATC 


TCGGCAGCTG 


64980 


ACATCCACCC 


ACCCCCAGGC 


CTGTGTGAAC 


65040 


TTTGTTAGTT 


ACTTGATGTG 


CAGGGCTTTT 


65100 


GTGGGCTTCT 


GCCATCTCAT 


TCAGGACCAG 


65160 


CACCCCTTTG 


GAGATGGGGT 


CTCACTCTGT 


65220 


CGGCTCACTG 


CAGCCTCCAC 


CTCCTGCAGC 


65280 


GCCTCGGCCT 


CCCAAGCTCC 


TGGGACTACA 


65340 


TTGTTTTCCT 


CACACTTTAT 


TTATTGAAGA 


65400 


TCTGTACAGG 


AGGGGCAGCC 


ACACGAGTTC 


65460 


GGGCCCCAGT 


TAGACTGCGG 


CCACCGTTTG 


65520 


AGAAGGCTCA 


CAGTATCTCC 


TTTCCCGTTT 


65580 


ACCTGAATAT 


GTTAATCACT 


GGGGGCTTGC 


65640 


GTCAGTTAGC 


AGGAATAACT 


TAGGACCACG 


65700 


CATTCACACA 


GGAAAGGTGG 


GACAAATGCT 


65760 


TGTCATCCCA 


GCACTTTGGG 


AGGCCCAGGC 


65820 


ATCCTGGCCA 


ACACGGTGAA 


ACCCCGTCTC 


65880 
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TACTAAAAAT ACAAAAAAAT TAGCCAGGTG 
TGGGAGGCTG AGGCAGGAGA ATCACTTGAA 
ATCGCACCAT TGCACTCCAG CCTGGCAACA 
TCAATCAAGT GTCATCACTG AATGTTTGTG 
5 CTCCCTCCTG AATCTCACTC CTGACCTCAG 
TCTTGTGTCC TGACCCCGGC GGTTGTGGCC 
CTCATGTCCT GCTCCAGACC TGGAATCAGC 
CTGCAAGACG. GTATTTCAAG ACCCGCCGTG 
GGTCGTTGTG ACTGGGTGTT TTCGTGGAGA 

10 TGTTAATTCT AAAGGTCAAA CACCCGGCCA 
TTTGGGAGAC CAAGACTGGT GGATCACCTG 
CAGGGTGAAA CCCGATCTCT ACTAAAAATG 
CTATAGTCCC AGCTAGTCGG GAGACAGACA 
TTGCAGTGAG CAGAGATGGC GCTGCTGCAC 

15 CTCAAAAATG AATAGATAAA TAAAGATAAA 
GATTTGGCCT GTGTAGCCTC TTCCTTCGCC 
TGTGGCCTCT TCCTTCCCCT GTTGGTGGAT 
CCTTCCCCTG TTGGTGGATT TGGCCTGCAC 
TGGTGGATTT GGCCTGCACG GATTCTGTGT, 

20 GCCTGCACGG ATTCTGTGTG GCCTCTTCCT 
TTCTGTGTGG CCTCTTCCTT CCCATGTTGG 
CTCTTCCTTT CCATGTTGGT GTCCTTTTTT 
GGGTTGTTGG CACGAGCGTG ATGCAGACTG 
AACATGGAGC TGGAAGACAT TGCAAAGCTG 

25 GCAGCTGGGT GGGGCCTCCT CCTTGCGAGG 
ATCCTGCTGC GCCTGGAGGG GGCCATCGAT 
GTCACCAAGC CAGGGAGGTG AGAGGCGGGG 
GAGCTAGAAA CGGGCCATGG TGCAGTCCTG 
CAGGCCTGTT TTCCCTTTTT GAGACCTGGG 

30 TGAGATGTTG AGAGGAGCAG CACACACTCC 
GGCACACACA CCCATGGAAC AGCACACACA 
ACAGCACACA CACTCCCACG GAACAGCACA 
GAACAGCACA CACACTCCCA CGGAACAGCA 
CGGAACAGCA CACTCTCCCA CGGAACAGCA 
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TGGTGGTGCA 


TGCTTGTAAT 


CCCAGCTACT 


65940 


CCCAGGAGGC 


GGAGGTTGCA 


GTGAGCCGAG 


66000 


GAGCGAGACT 


CCGTCTCAAA 


AATCAATCAG 


66060 


TGTGAACGTG 


GGGATTGGTC 


CTGCCCCATG 


66120 


TTGCTGCACC 


TTGAGGTGTT 


TTCTGTGGGC 


66180 


TCTGCTGTCT 


GGGAGTCAGG 


ATTTTTCACA 


66240 


CAAGTCTCCA 


AGAAGCCCTG 


CTTTCTTTTC 


66300 


CGGCAGCGGG 


TTGGTCATGG 


TTACTGGGTT 


66360 


TACAGCCATA 


CGCACAGGTG 


TGTTCACAAA 


66420 


GGCATAAGGG 


CTCAGCGGTA 


ATCCCAGCAC 


66480 


AGGTCAGGAG 


TTTAAGACCA 


GCCTGAGCAA 


66540 


CGAAAATTAG 


CCGGGCATGG 


TGGCGCACAC 


66600 


CGAGAATTGC 


TTGAACCTGG 


GACATGGAGG 


66660 


CCCTGCCTGG 


GTGACAGAGT 


GACACCCTGT 


66720 


ACACCTGCTC 


CTCTTGGTGT 


CTCCAGTTTG 


66780 


TGTTGGTGGA 


TTTGGCCTGC 


ACGGATTCTG 


66840 


TTGGCCTGCA 


CGGATTCTGT 


GTGGCCTCTT 


66900 


GGATTCTGTG 


TGGCCTCTTC 


CTTCCCCTGT 


66960 


GGCCTCTTCC 


TTCCCCTGTT 


GGTGGATTTG 


67020 


TCCCCTGTTG 


GTGGATTTGG 


CCTGCACGGA 


67080 


TGGATTTGGC 


CTGCATGGAT 


TCTGTGTGGC 


67140 


CCATGCCAGG 


AATCCTGGTT 


CTCAAGGGCG 


67200 


CCTTTGCTGC 


CTTTCTCTTG 


CCCAGGGCTG 


67260 


AAGAGTAAGT 


GTTGCCCTCC 


CCGCCTCCTT 


67320 


AGGTGGGTGA 


CACCTCCTCG 


ACCCACAGTG 


67380 


GCTGTTGAGC 


TGCCTGGAGA 


CGACAGCGGT 


67440 


AGCCAGCCCC 


TTCACTGCAG 


GCCCAGCCTA 


67500 


GGCTGTCACA 


TCACGAGTGA 


GGCCTGTTTT 


67560 


AGGAGCACCT 


GCTTTGCATG 


ATCTGGTTGC 


67620 


CACGGGACAG CACACAGCCC 


CCCACGGAAC 


67680 


CTCCCACGAA CAGCACACAC 


ACTCCCACGA 


67740 


CACACCCACG 


GAACGGCACA 


CACACCCACG 


67800 


CACACACCCA CGGAACGGCA CACACTCCCA 


67860 


CACTCTCCCA CGGAACAGCA 


CACACACTCC 


67920 
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CACGGAACAG 


CACACACACC 


CACGGAACGG 


CACACACTCC 


CACGGAACAG 


CAGACTCTCC 


67980 




CACGGAACAG 


CACACACACT 


CCCACAGACA 


GCACACACAC 


ACCCACGGAA 


CAGCACACTC 


68040 




TCCCACGCGG 


GGCCGCTGGG 


TTTCCTGCAG 


TTTCTCCTCC 


TCCAGGCCTT 


TCCCTGGACC 


68100 




CTGGTCCAGT 


CCGTCATTTG 


AGCACAGGTG 


CCTGTTAGAA 


CGAGACCTTC 


TTGTTAGGAC 


68160 


5 


GATGAGTGTC 


CCAGCCACCA 


CCTCTTTTGG 


ACTCCGGGAG 


GCCTGGAACG 


TTCTGAACGC 


68220 




TCCGTGGGGC 


TCCAGTCTTC 


TCCGCAGCCA 


GGGCAGCAGG 


GTTTGCTGTC 


TGTCCTGCAG 


68280 




GCAGATGAGG 


AGTCAGGGCT 


GGGGCCTGTG. 


TGGGGGCTCT 


CCTGAGCGCG 


CAGCCGCCGA 


68340 




GGTGGAGCGT 


GTTCTGCCTG 


AGCGCCGACC 


TGGTCGGGGG 


AATCCCAGTT 


GCTTCCAGGT 


68400 




GGAGCCACTG 


TCCTCAGCGT 


AATGCTCAAG 


GCTCTGGCCT 


GGCTCCTCGG 


CCACCCTGCA 


68460 


10 


CCCTCAGGGT 


CCCCTCCTGT 


AGCTTCTGCT 


GCCCCATCAC 


TGTCACTCTC 


CAAAGCTTTG 


68520 




GGGACTCTGC 


CCAGAGCCAC 


CGCCTCCCAG 


AAGCCCCTGA 


CAACCTCTTG 


ACGACCCCCT 


68580 




AGTGACCCCA 


TCCCTCCCCT 


CTGACGGCGG 


CCCCTGCTCT 


GAGGCGGCTT 


CTTTTCCTCG 


68640 




GTGCTGTTCT 


CGTGCTGGCC 


AGGCCTCCTC 


TCCCCACCTG 


GAGGCTCCTG 


AGGGCGGAGG 


68700 




CCTCTCACCT 


CCAATGCTGG 


CGTCCCCTGG 


AGGGCTGAAT 


TTGTTTCGGA 


GGGAAGGAAA 


68760 


15 


CTTCCACAGT 


TGTTGCCTTC 


AGTTCCAAAG 


CTGCAGCCTG 


ATTTCCCCCT 


CCAGGCTCGA 


68820 




GCCTGTTTTC 


TTCTCGGCAG 


CTACATCTTT 


GACCAGTGTC 


GTCCCCCCTC 


AGGCCCGAGC 


68880 




CTGCCTTCTT 


CTCCTCAGTT 


CCCAAAGCTG 


CAGTCTGGTC 


CCCCCGCCAG 


GCTCGAGCCT 


68940 




GCCTTCTTCT 


CCTCGGCAGC 


TACATCTTTG 


AGCTGTTTGC 


TGAAGCCCAG 


ATCACGTTTC 


69000 




AGACCAAGGG 


CTGCATCCTG 


GACTCGCTGG 


ACCAGATCAT 


CCAGCACCTG 


GCAGGACGTG 


69060 


20 


AGTGCTGGCA 


CGGGGTCTTT 


GGTGCGGGCA 


AATGTGGCGT 


AGGGGGTGCA 


GCAGGCCTCC 


69120 




ATCTTGGCAG 


TCAGGGCTCC 


CCTGGCCGTC 


ACCTGGCCGT 


CAGCAGGAAC 


AGGCCCACAG 


69180 




AACCTCATCT 


TCTGATCGGG 


GCGTGGAGGC 


GTTAGTGCCA 


CTTGCCAGCT 


GCCGTAGAGC 


69240 




CTGTCCCAGT 


TCTGCAGCTG 


GCGGCTTCGT 


CCTACAGCCT 


CATCCCATTA 


TTCTGCTTTT 


69300 




GAGAAAGAGC 


AGCCCAAGGC 


CCTAGCTGGC 


TTGTGGGGCC 


TCTGGCTTCT 


CCACACCACC 


69360 


25 


CCGAGTTCTG 


CTTCTCAGAG 


TTGTGGGGTC 


CAGAGGCTTT 


GCCCAGAGGC 


GGTGTCCCCA 


69420 




TGGGCTGCTC 


TGGTTTGAGA 


CGCCGGGCCC 


AGCGGGGTCT 


CTCCTCTGCT 


GCGCTCCCGG 


69480 




GTGCTGGGGA 


GGGTGGCTTT 


TGCTGCTTCA 


ACCCTTAGGC 


GACCATAGAG 


CCTCTTTTCA 


69540 




AGTCCCACTG 


ACCCCCTTGG 


AGACTCTGTC 


CCTGCCTGGC 


TTCTCTCCTG 


GCTGCTGGGA 


69600 




AGAGCAGGCG 


AACTGCCCGC 


CCTGAATGGA 


TGCTGCGCTC 


CACCCTGGGC 


CCCCCATTGG 


69660 


30 


GCAGGAGATG 


GAGCTTGGCA 


GTCGGGCTGA 


GCGGGCTCAT 


GCTGGAAGGG 


CGGGGGCTGG 


69720 




GGTCGGGGCC 


TCCCCTGCCT 


GCAGTGTGGG 


TGTCAGCGCC 


CTGCTGCCCT 


CCAGGTGCTG 


69780 




GAGTGTTCAC 


CAACACGGCC 


GGACTGCAGA 


AGCTGGCGGA 


CATTATCCAG 


GTGGGGCCTG 


69840 




CTCCTCTGTG 


GCATCTCCTT 


CCCTGATGGA 


AGCCGGGCGG 


GTGCCTTCTC 


CTGCTGTATT 


69900 




AGTTAACTGA 


TTCTAGACTT 


GGGGATGGGA 


GAAAGGCCCC 


TACACCACCT 


GTTTCTGATT 


69960 
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GGCAAACTCT 


CGGCTCCTTT 


CCAGTGCCCT 


AAACCCACAC 


TGGGCCTCCT 


GCAGGGATGG 


70020 


GGGAGGACGA 


GGTCTGGTGG 


CACATGCCCA 


GGGTGATGCT 


GGTGAGGGAG 


GACGCAAAGG 


70080 


ACAGTGGGGG 


CCGGGGAGCC 


GCTCCTGCCC 


TGTCCGGGCC 


CTCAGGCCAG 


GGGGGACCCA 


70140 


CTGCTGGCAG 


CCCCAGCAGC 


CCCAGCTGCA 


CGCAGATGAA 


GAGCTCTGGA 


CACACGCGGC 


70200 


TTCCTGAACA 


GCTTCTCCAG 


GGACAGACAA 


ATGGGGACCC 


TGCAGGTTCC 


CGGCAGGGGT 


70260 


GTCCCTGGGA 


GCCCATGATT 


GGGGGTGCGA 


CCCTGGCCCC 


CTTCTCATTG 


GCCCCGTCCT 


70320 


GTCCTGCAAT 


GCCCGTCCCA 


TGTGAGGTCT 


GCTTCTGGCT 


CCATGCCTAT 


GGCAGCACCT 


70380 


GCTTTCCCTG 


GCGTAGAGGT 


GCTTGTCCGG 


TTTGTGGAGG 


GCACGCCCCA 


TTTTGGGTGC 


70440 


TCTGGGCACG 


TTGCCTCTCC 


GGGGCCTCGG 


TGGCTTTTTT 


AGAAGCAGAC 


TCAGAAGTCC 


70500 


CTGACTGGGG 


AAGCCAAGGC 


ACAGGTGGCT 


GTGTGGAGCC 


CTGTGAGGCC 


TCCTCTGTGC 


70560 


TGCCCACGCT 


GTACCTGCTG 


GCCACACGAG 


ATCATGGCAG 


GGTTAGGCAG 


GGCTGCCCAG 


70620 


CGCTATGACA 


GCTTCATGAG 


TGTCCATCTG 


GCCTGTGGGG 


TGCTTGAGCT 


GGGGGAGGCC 


70680' 


GCAGAAGAAC 


CCTGGGATGC 


ATGGCTGGCC 


TGTGCATGCT 


GCTGGGCATG 


GAGCTGCAGA 


70740 


TCCCGGAACA 


AGCAGGCACT 


GCCTTCTCCT 


TCACAGACGC 


AGCTCTGAGC 


GGGGGCGAGA 


70800 


CCTGGGCAGG 


GACCAGGTGG 


GGTGGGCACA 


GGGTGGTGGG 


GCCCAGGCTC 


AGCCCTCCCT 


70860 


CCACTGTGGC 


CGTCTCTGTG 


GCCAGTGACG 


CCACAGCCTG 


TGTCTTCTCT 


GTGCGGTAGC 


70920 


TGGGGCTGGA 


AGGACAGCAC 


TGCCTTGTCC 


TCCCAACTCC 


TCCCCAAAGG 


CACGGTGGGC 


70980 


ATCCCAGGCC 


CAGACCCCTC 


TGTCTGTGGC 


TCCTGCCTGC 


CAAGGGCTGC 


TGTGCTGTCC 


71040 


CGCATGGAGT 


GTGGTTGGCT 


CTTCAAGCAG 


GAGGCCGTGC 


ACCTATCAGG 


CGGACCTGCT 


71100 


TCCATGTCCC 


TGATGGGTCA 


CTGCAAAGCA 


CCTCCAGCAC 


ATGGCCAGGC 


GAGGTAGCCC 


71160 


TGCAGCCCAG 


GGCCTGGAGG 


GCAGGTGTGA 


GCTGGCCCGG 


GCCTGTCCCT 


CCCTGGAATA 


71220 


CAGCTTCCCA 


GGCTCCCACT 


TATGGAGAAG 


TCTCCTCCAC 


ACTATGGAAC 


TGAATCCTAG 


71280 


AATGTGGCTT 


CTGAGGTTCC 


TACACTCGAA 


CTGAATCCTG 


GAATGCGGCT 


TCCAAGGCTT 


71340 


CCAGCTATGG 


AGAAGACTCC 


ACACTCTGGA 


ACCGAATCCT 


GGAACGCGGC 


CTCCCAGGCC 


71400 


CCCAGCTATG 


GAGAAGACTC 


CACACTCTGG 


AACCGAATCC 


TGGAACGCGG 


CCTCCCAGGC 


71460 


CCCCAGCTAT 


GGAGAAGACT 


CCACACTCTG 


GAACCGGATC 


CTGGAACGCG 


GCCTCCCAGC 


71520 


CTCCCACTTA 


AGGAGAAGTC 


TCCACACTCT 


GGAACCGGAT 


CCTGGAACGT 


GGCCTCCCAG 


71580 


GCCCCCACTT 


AAGGAGAAGA 


CTCCACACTC 


TGGAACCGAA 


TCCTGCACAC 


TCCATCGGTT 


71640 


TGGAATTTCC 


TTTGGCTGCT 


GCTCTAAGTA 


GCCGCTGGTG 


GATGACTCAG 


CTTCTGCCAG 


71700 


CCCTCGGGTG 


CCTGGAGGAT 


GAGGGACTGC 


ACACAGTGCT 


CACCCGCGTT 


GGCTCCTGAG 


71760 


CCCCTGCAGG 


TGTGGGCGGT 


GCCCATAGGG 


CTGGTGCTGG 


GTTGGGCCTG 


CAGCCCTGAG 


71820 


TCACAGGTGA 


CCCTGGGGGC 


AGAGTGGGGC 


CAGTGGCCCC 


AGGAAGAGGA 


TGTGGGATGC 


71880 


ACAGCTCAGC 


TGGAGGCGAA 


CTCCAGGCAG 


GGTCAGGCCG 


TGTGCTCGGA 


AGTCAGGGCT 


71940 


TAGCTGGAGG 


CAAACTCTGG 


GCAGTGCTGG 


CCCGTGTTGG 


GGAACCAGTT 


GCCCCTGGGC 


72000 
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CCCCGTGAGA 


CTGCTGGGTC 


CTCATCCCTC 


TCTGCCTGAG 


GCCGGAGCTG 


CCCTGGGCTG 


72060 


AGGCACAGGG 


GGATTTGTGG 


TGGTGTTTTT 


TTGAGAAAGG 


GTCTCGCTTT 


GTCACCCCGG 


72120 


CTGGAGTGCA 


GGGGCTTGAT 


CACAGCTCAC 


TGCAGCCTCA 


ACCTCCTGGG 


CCCAAGTGAT 


72180 


CCTCTTGCCT 


CAGCCACCCG 


AGGAGCTGTG 


AACACAGGTG 


TGCACCACCG 


CACTCAGCTA 


72240 


ATTTTTAAAA 


TTTTTTTGTA 


GAGATGAGGT 


CTTGCCATGT 


TTCCCAGGCT 


GGTCTCAAAC 


72300 


TCCTGGGCTC 


AGGCAGTCTG 


CCCGCCTTGG 


CCTCCCAAAG 


TGCTGGGATT 


ACAGGCAAGA 


72360 


GCTTCCATGC 


CTGCCCAGCA 


GAAGGCTTTT 


CGAAGGAAGC 


TGTTTCCTGA 


GGCAGACTCA 


72420 


GCCCTGCTCA 


TGGCAGCCAC 


CAGCGTGGGG 


GTGAACTTGT 


TCTGTTACTT 


CCATCCCCGT 


72480 


GGGCCAAATG 


CTTTGGTAAA 


ACACAAGGCC 


CTGTGTTTAG 


CTGTCTTGAC 


AGTGAAAATG 


72540 


GCTGGGAAGG 


AAGGAAGGAA 


CGGAAGGAAA 


TTTCTCTCTC 


CTTCTGTGCG 


TACCCAGGCA 


72600 


CGTGCACATG 


CATGCAGAGT 


ACGCACACAC 


GCACGCACGC 


CTGCACAAAT 


CCACGCATGT 


72660 


TGCCAAGTCT 


CTGTGTTCCA 


GCCGTGGTGT 


CTGCCCCCCG 


GTGTTCTCTA 


GTTCGGCTTC 


72720 


TCCGCATTTC 


TGTGAATGAT 


TCCGGCTTCT 


TGGTGTTCCC 


AGCAGAACTC 


CCTCAAGTCT 


72780 


GCGGCGGGGC 


TCTGACGGCG 


GTGGCTTGGC 


TGACATGGCC 


ACATTGCTGA 


GCCTGTTGGG 


72840 


GGCTTTGCGT 


TCCTGTTCTG 


GCCGTTTTTG 


GCTCGTTTTC 


CAGGAACGGT 


CGTCACGCGC 


72900 


TCCTCTCCTA 


GTGCAGGCAT 


CATTCCTTTC 


CCATTGATTT 


GCAGGGTTCT 


CTGTAAGTTC 


72960 


TGAGGATCCC 


ATATACATAT 


ACTCTCTGTA 


AGTTCTGAGG 


ATCCCATATA 


CATATTCTCT 


73020 


CTCTAAGTTC 


TGAGGATCCC 


ATATACATAT 


TCTCTCTCTA 


AGTTCTGAGG 


ATCCCATGCC 


73080 


GACATACATA 


TTCTTTCCTT 


GTCTCATGCT 


GGTCATTTTT 


TCCATTTTCA 


TGACAGGTTT 


73140 


GGTGAACACA 


TGTTTCCTTG 


TCAGATTTTT 


GTTCTGAGCT 


TGTGCCTCCC 


GACCAAGATG 


73200 


CTAAACCGGG 


TCTTGTGTAT 


TCTCCAAACT 


GCACTGTAGA 


GTGACGGAGC 


TTTGTGTCTG 


73260 


GGCCTCCATG 


CCTTCTGACG 


TCACCTGTGG 


GGGTGTGAAA 


GGCAGACTCT 


ACCTTGATTT 


73320 


TTCCCAGCAC 


GCCACACCGG 


TGGTTCTGTG 


CGCTGACCGA 


GCGGCTCGGC 


TTCCCCCAAC . 


73380 


TCCACTGGGC 


ACCTGCCACA 


CTTTTCCTCA 


TGTTTTTGTT 


CACTGTGGTT 


TTGTCGTAAG 


73440 


TCCTGGTGTT 


GGCCTGAACC 


AATTTCTTTT 


TGTTTGTTTT 


TGAGACAGAG 


TTTTGCTCTT 


73500 


GTTGCCCAGG 


CTGGAGTGCA 


GTGGCGCGAT 


CTCGGCTCAC 


TGCAAGCTCC 


GCCTCCCGGG 


73560 


TTCACGCCAT 


TCTCCTGCCT 


CAGCCTCCCA 


AATACCTGGG 


ATTATAGGCA 


CCTGCCACCA 


73620 


CGCCTGGCTA 


ATTTTTTGTA 


TTTTTAGTAG 


AGACGAGGTT 


TCACCGTGTT 


AGCCAGGATG 


73680 


GTCTCGATCT 


CCTGACCTCG 


TGATCCGCCT 


CCCAAAGTGC 


TGGGATTACA 


GGCATGAGCC 


73740 


ACCGTGCCCA 


GCCTGATATT 


TTTAGTAGAA 


ATGGGGTTTT 


GCCATGTTGG 


CCAGGCTGGT 


73800 


CTCGAACTCC 


TGACCTCAGG 


TGATCCTCTC 


ACCTTGGCCT 


CCCAGAGTGC 


TGGGATTACG 


73860 


GGTGTGAGCC 


ACCACGCCCG 


GCCTCTTGTT 


CTTTTGAAAC 


CTGCCCTGAC 


GTTTTTTCCA 


73920 


TAGTGCATCT 


TGGAGTCAGC 


GTGTCTACTT 


CCTGTAAAAA 


TCTTACTGTG 


ATTTTGACTA 


73980 


GAATGTGTTG 


AATTCCTGTT 




GTCAGGGTCT 


CTCTGTTGCC 


CAGGCTGGAG 


74040 
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TGCAGTGGGA CCATCACAGC TCACTGCAGC 
AGCTCAACCT CCCAAGTAGC TGGGACCACA 
rprp^YTTTTTT TTTTTGGTGA ACACCCTGGG 
ACTCCTGGGT TCGGGCAGTT TGCTCCTCTC 

5 GAGCCACTGC ACTAGGCCAT GTTGAATTTC 
AGAGGAGGGC TGGGCCAGTT GGCGGGAGGA 
CATGGAGATG GCCTATGTTG GGGGAACACA 
AAAGGGTCAG GCAGGTGCCA GCTCCTGTCC 
CGGTGAACCG ATGACCCCTG GGTGTCCTGT 

10 AAAGTCGGGT GGCCTCAGGC TCCTGACGGG 
TCAGTGTGGA CCCCTCCGAG GGCAGCCCTG 
CCTATAAGGT AGGGGCCACC TCCAGGAGGC 
GGCTGGGGGC CTTACAGTCC TATAAGGTGG 
GGGTCTTCTG GTCCTAAAAG GTAAGGGGCT 

15 TCCTGGTCCT GTCCCCTCCA GGTGCACATC 
CAGCGGTCTG ATGCCTGGAG CACCACTGCA 
CAGGCTCCCC GCCCGCCCGG GTGCAGTGCC 
GCCCAGCCGT GGATCTCCTG CCCCCATGGG 
CCAGAGCATT CTCCTCACTG TCCCTCTGAA 

20 GCCTGATGGC ACCGTGACCT GCCCAGAGTC 
AGCCAGGCCT CCCTGCTGGG ACACGCTCGT 
ACAGTGAGGC ACGGCTCTGC CTGCTGCCTC 
GGTCACATCT GAAGGGGGCC CGGCTGGCCC 
CCGAGGCCCC ACGTGCCTTC TTGGCTTCCT 

25 CCGGGGACGC TGTGTGGATG GGGTCTGTGA 
GTGGTTCTTG GGGTGTATCT GGCCCCACCC 
CGGCTGCAGC TGCGGGAGCT GCTGCCCCTG 
GCTTGGTTCG TCCCTCTCTG GAACATTCTT 
GACGTGGCCC TGGCTGAAGC AGCCCTTGTG 

30 GCCGGAAGGG AAGGGCTGGG AGGGCGTCAG 
GGTCCTGCAG CAAGGCCTCT CCCAGCAGGA 
CTCACGCAGT TGGGGTTGCG GGAGGCAGTG 
AAGGGCCTGG AATCTGTTTG TTCCATTGAC 
CCACTCCCTG CACTGGGCGT TTGTAGGAAA 
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CTCAACCTCC 


TGGGCTCAGG 


GGATCCTCTC 


74100 


GGCACATGCC 


ACCATGCCCG 


GCTAGGTTTT 


74160 


GTTGCACCAT 


GTTGCCCAGG 


CTGGTCTCGA 


74220 


AGCCTCCCGG 


AGTGCTGGGA 


TTACAGGCCT 


74280 


TAGATTAATT 


TGGGGCCCTC 


AGGGGCACAG 


74340 


GAGGCCCCTC 


GGGCTGCCGC 


ATTTTCAGTG 


74400 


GAGCTCACCG 


GGGGTCCCTG 


CAGGGAGGAG 


74460 


ATTGGCCTGG 


GGCTGCATGA 


TGGCAGGGGC 


74520 


GACCTTCTGT 


GTATGCGGCT 


GATGCTGCAG 


74580 


GCTGCACTTC 


CTCTGCCTTT 


CAGATTGTGT 


74640 


GTTCCCCAGC 


AGGGCTGGGG 


GCCTTACAGT 


74700 


AGGTGGAGGG 


CAGCCCTTGT 


TCCCCGGCAG 


74760 


GGGCCACCTC 


CAGGAGGCAG 


GTGGGGCTGG 


74820 


GCCCCCAGGA 


CATGGGC GGG 


GCCTCCACAC 


74880 


CATCCTGATG 


CTGGTCACCG 


GAGGACGGCT 


74940 


GCCAGAAAGC 


GAGGTACAGA 


CCTGGGCCCA 


75000 


CGGCACCACC 


ATGCCACAGG 


CTAGGCACAT 


75060 


CCTGGCCACC 


TTCTCCATAT 


CCAGGCCAAT 


75120 


GATTGGAGTT 


ACTGAGAGAC 


GTAGGAGATG 


75180 


ACCTGGTTGG 


TGGTGGCAGA 


GCCACAGCCC 


75240 


TTATGCCGAG 


GCCGTCAGCA 


CAGAGCCTCC 


75300 


CACGCAGCGC 


CTGGCCGGGC 


CAAGCCTCAG 


75360 


TGTTGTCCGA 


AGCCCCTGGT 


GCGCTCAGCC 


75420 


GTGCTCCGTG 


GCGTCTTCGA 


GTCGGTGCTG 


75480 


GTGTGCCCTC 


GGCTCCGTGT 


CCGGAGCCCT 


75540 


CCACTGCGTG 


GTGTCCAGGG 


TGGGGCTTCA 


75600 


CCTTGTGCTC 


CAGTGGGGCC 


TTGCCTCTGG 


75660 


TCTCAGCTGC 


TGTCCGACCC 


ATGGTGGCAT 


75720 


CGGTTGCTGT 


GGTTGGGTCT 


GCCTGGCCGA 


75780 


GGTGGCGTGG 


CTTGACCCCC 


GCTCGGTGAT 


75840 


AGCGTCCATC 


CCGGGGGGAG 


GCCGGCGCCC 


75900 


CGTGCCTGAG 


GCAGCCGGTG 


CACAGATTCC 


75960 


CTCTGATGTC 


ACTTGACTTC 


TCAGAAGCAG 


76020 


TGAGCTCCTG 


GAGGAGGGGG 


TGGGGAAGTT 


76080 
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CCCCCATTGC AGGGCACACT CAGCCCCAGG 
GAATCGCAGT CAGAGTCGTT CTGCTTGTGC 
ACTCAGCCTC CTCCCCAGGC CACGGCCGCC 
TTCCTTTGAG CTGGCGCCCT TGTTCACTGC 

5 CTGCTTCCAG GCAGAGCTGG GAACTCGCAC 
GGGATGCCGG CGCTTCCTTC CCGTGTGCTC 
CATGTCACAG TTTCTGCAGA AGTTTAGGGT 
CCAGCCTCTG CCTGGGGTCT GCCTCCTACT 
CAAGCTGGCA GGCTCACACT CAGGGCAGCC 

10 TGGGTGGCTG. CCCGCGGCTC GCTTGCTTGA 
GAGGCAGGTG GTCAGGCCCC CAGTCCCGTC 
CCCCACAGGG AAGGTGCTGA GCTACTGGTG 
GGTCCGCCAG GGCGTCCGCT CCCTCATCCT 
CTTTGCTCTG GAGATGCAGA TGTACGGGCC 

15 CCTCTGACAT CAGCGGGGTG GAAGTGGTGG 
CGGGCCTCGA GGGCTAAAGG GGTGCTGGTG 
CGCTCAGCCC TTTCCCAGTC TGCCTGGAGA 
GGGTGGGGGT CGTCCCCAGA GGCCCCGATG 
GGTGAGGGCC TGTCCCTGGG CCCTGCTGGG 

20 GCCCTGAGCA GTGGCCTCTC CGGCTCTAGG 
AAGGCTCTGG GTGAGTGCCC TGAATGCCCC 
GCTCCCAAGA GCTGGTAGGG ACCCCTGCAG 
TGCACTTCCA GGCAACATCG CCCGCGTGGT 
CTATCCTGTC ATGGAGAAGA GCCTGGAGTT 

25 CGGGGTGGGT GAGGGCAGGG CTGGAGCATG 
CCTCATCGGA TCGGCGGCGT GACCAGGGCT 
CCGCGACTTG GCCAGGAAGA TGGAGGCGCT 
AGGCAGCTTC TCCGAGGTCG GCACTTGGCC 
TCCCCTGCCT CTCACAGCTT CCCCAAGGCT 

30 GCTCCCGCTG CCTCTTCAGG GCTCCTGCGT 
TCTTACAAAG CCCCCAGCAC CGGGTGGGTG 
CCTAGTCGGC CACCCTGGCC CTGGGGTTCC 
TGGGGTCAGC CTGGGATCCG TGCCAGGAAG 
CTAGGACGAT CCTTCATCTT GGAGCATGAG 
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AAGGAAACGT 


GCCTCGTCCC 


TGCTGACTCC 


76140 


CGTGTTGAAT 


TCCCGGCATC 


CGGCATCCAG 


76200 


GTGGCCAGTC 


GGTCAAGCCC 


TTCTAGGAAC 


76260 


TGACGCCACT 


CAGAGGCTTG 


TGCACGTGTC 


76320 


CCCGTCTTCT 


GCACGCGGCC 


GTGGAATGTC 


76380 


TTGGCGGGGT 


GGGCTTCTTG 


CCCTGAGCCG 


76440 


TGGAGTGGGC 


TGACCTCTCT 


GCAGGTGTCC 


76500 


CCCAGGACCC 


CCTGTCCCCC 


AGAGGGGCCC 


76560 


TCCTTTGTTC TGACTTCTGC 


ACAGTGGGCC 


76620 


TGCCAGTGGG 


TGGAGAGGGT 


GATGGGCAGA 


76680 


CTCACACTCT 


GTGCCCTCTG 


CCGCCCCCCG 


76740 


CTTCAGTGCC 


GGCCACAGCA 


TGCACGAGCT 


76800 


TACCAGCGGC 


ACGCTGGCCC 


CGGTGTCCTC 


76860 


ACCCCTGCCA 


GGGCCTGAGC 


ACCGGTGACA 


76920 


GGGTCCCCAT 


GAGCCGGGTG 


CTGGGGGTCT 


76980 


CACTTCCCCA 


CTGTCTGCTC 


CCTCTGGCCA 


77040 


ACCCACACAT 


CATCGACAAG 


CACCAGATCT 


77100 


GAGCCCAGTT 


GAGCTCCGCG 


TTTGACAGAC 


77160 


GTGGGAGGTG 


GGGGAGCACT 


GAGGCCTGAG 


77220 


TTTTCCGAGG 


AGTGCTTATC 


CTCCCTGGGG 


77280 


AGCTGTGCGC 


ATCCTGGATC 


CTGGACCCCT 


77340 


ACATCCTGCC 


CCTGCCTTGA 


CCCCGGCCCC 


77400 


GCCCTATGGG 


CTCCTGATCT 


TCTTCCCTTC 


77460 


CTGGCGGGTG 


CGTCTCCCCT 


GTGTTCTGGG 


77520 


AAGCAGGCAG 


TGGTCACAGC 


TCCTGCTTGC 


77580 


GCCGTGTCCC 


TGCCTCTTCC 


TCCCACAGGC 


77640 


GAAGCCGCTG 


TTTGTGGAGC 


CCAGGAGCAA 


77700 


GGGGCTCTGG 


GCCTGCTGCC 


CCCTCGTGCC 


77760 


GACCACTGGC 


CCTGACCATG 


GGCTCCGGCG 


77820 


TTCCTTCCTG 


GCCCTGAGTG 


TTGCCTCTTA 


77880 


TGGTAACAGT 


GGCCCTCCTG 


TCTGAGTAGC 


77940 


CCGTGTTTTC 


TGGGAAGCAC 


TGAGCAGGCG 


78000 


AAGCTTCCAG 


AACCCGATTG 


GCCTTCCTGG 


78060 


ACCTGGGTCT 


CCCTCATGGG GGAGGAAGGG 


78120 
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GCTGGGGGGG GGCTCCAGGC TCAGCCTCAC 
TATGCAAGGG TTGCCGCCCC TGGGTCCACC 
AAGGTGAGCT CTCCAGGGCC CTCTGCCCTG 
ATGGCTCCCC AGCAGACTCT GGGCCCTGGG 
5 AGAGCGACTG CTGGCCCTGC TGGGAGCGTG 
GAGTTTCCTG GCCACAAGAG TTGGAGGTGG 
CCTGACCCAC AGATGGAGCT TCCTCCCACC 
GGCTGGGCTG GGTGTGGGCA CCAGGGAGAG 
TGTGTCCCCT AGGCCAGCGA GGGGCTGGAC 

10 GTCACGGGCC TCCCGTACCC CCCACGCATG 
CTGGATGAGA TGAAGGGCCA GGGTGGGGCT 
GTGGGGTAAG GCGGTCTGGT GACTGAGCCC 
CGAGCCGCCT CGCCCCACAG TTCCTCTCTG 
GGGCTGTGAA CCAGGCCATC GGGCGAGTGA 

15 TCCTCTGTGA CCACAGGTGC GTGCAGTCCG 
ACACCCCACT GGGCCCCTGG ACTCTCCTTC 
TCTCCGGCTA CTCGGGGTCA GCGTGGGGCC 
GAACTGAACC CTTGAAGCGG CTGTGGGCAG 
CGCCTTTGCC GACGCAAGAG CCCAACTGCC 

20 TGACAACTTT GGCCATGTCA TCCGAGACGT 
TGTGAGTTCC TGCCCAGGGA GGGGATGAGG 
CCACCCACCC CCATGGTTCT GCAGATGCCA 
GTGCGTGGAG AAGATGCTGT CAGCGAGGCC 
AAAGCTAAGA GTCTGGACCT GCATGTCCCC 

25 GGGCAGCGCT GGGTGGGCGG TGTGGGGGTG 
CAGGCGCCCA GGGCGGAGGC GACTCACCTG 
GGCTGCCTCT CCCTCCTAGG GCAGGGCCCC 
GAACGCCCCA GGCAAGGATG CCCCCCGAGG 
CCCACAGGCA GGACCCCAAG TGCTGGGACT 

30 GGACCCCAGC TGGGGCCCCC CGTGGGCTTC 
CCAGCTGCCG GGGACCCCGA GAGTAGCCTG 
GCCCGGCAGA GGCCCAGGGG GCTGCTGGCC 
AGCCCTGGCG AGGAGCAGGT ACAGTTCCAG 
CCTGAGGCCA ACCCGACCCC GCCCATCTGG 
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CAACTTTCCT 


TCCAGACCAT 


CAGTGCTTAC 


78180 


GGCGCCACCT 


TCCTGGCGGT 


CTGCCGGGGC 


78240 


ACCTGGTTGC 


CTGTTCCCTG 


GTGGGTGCTT 


78300 


GGCTGCCCGG 


TCCCCTCCTT 


GGGTCCCACG 


78360 


TCCTGCTCTG 


GGCCTGGGCA 


GGCAGGATGG 


78420 


CGTCTGGGAG 


CTGTGGACCC 


CAAGTGGGGT 


78480 


CCTGGrrGGG 


GACGGAGCCT 


CGGGGAAGGT 


78540 


GAGCCCCCAC 


GGCCCCAGGC 


AGCTCCCTGG 


78600 


TTCTCAGACA 


CGAATGGCCG 


TGGTGTGATT 


78660 


GACCCCCGGG 


TTGTCCTCAA 


GATGCAGTTC 


78720 


GGGGGCCAGG 


TGAGTTACAG 


CAGGGTGGGG 


78780 


CCGCCCCGTG 


GCCAAGGGAG 


CCCCCGTGAC 


78840 


GGCAGGAGTG 


GTACCGGCAG 


CAGGCGTCCA 


78900 


TCCGGCACCG 


CCAGGACTAC 


GGAGCTGTCT 


78960 


GTGGCAGGCG 


CGGCGCCAGG 


GGACACGCCC 


79020 


CCCACATGAG 


GCCCCGTCTC 


CTCCAGAGCC 


79080 


CCTGCAGCAG 


ATGAGGGTCT 


TCACTTCGGT 


79140 


GGCAGCAGGG 


CTATGGCCAC 


CCCCCAGGTT 


79200 


CTCCTGGGTG 


CGTCCCCACG 


TCAGGGTGTA 


79260 


GGCCCAGTTC 


TTCCGTGTTG 


CCGAGCGAAC 


79320 


GTGTTGTCCC 


CAGAGGAGCC 


AGAAATGGGT 


79380 


GCGCCGGCCC 


CCCGGGCTAC 


AGCACCCAGT 


79440 


AAGTCGCCTG 


GCCCCTTCTT 


CTCCACCAGG 


79500 


AGCCTGAAGC 


AGAGGTCCTC 


AGGTGCGGAC 


79560 


GCGGAGCGGG 


CGGCGTGGGG 


CGGGCAGCAC 


79620 


GCTTTGTGCG 


CTTCCCCTCC 


CACCTCCAAA 


79680 


CACGGGCTGC 


AACCCTCCCC 


TACAGGCAGA 


79740 


CTGAGACTCC 


CCCCAATAGC 


AGGGAGGACA 


79800 


CTCCCCCAAG 


AGGGGCTTTG 


CCACAGGCAG 


79860 


ACTGCGCACT 


CGGGTGCCCC 


TGCAGGGTCA 


79920 


TGTGTGGAGT 


ATGAGCAGGA 


GCCAGTTCCT 


79980 


GCCCTGGAGC 


ACAGCGAACA 


GCGGGCGGGG 


80040 


GGCCTTGGGA 


TGGACACAGA 


CCCTCTGTCT 


80100 


CCTCAGGCAC 


CTCCCCACAC 


ACCCCTGTAA 


80160 
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ATCCCCTGCC TGGGAGGCAG GCGGGCAAGC 
GTGGGTCCTC CACCCCACCT CACCCACAGG 
CCCTCCGCGG GTGCCCCCCA CATCACTTTG 
ACTCAAGGTC GGGTGCCCCT GCCACTGGCT 

5 TGCTGCGGGT GGAACCCAGG TTCCTTCTTC 
TTCCTCAGGC CAACCCGAAA TTTTCTCCCT 
TGCCCATCCC TGCCCTTCAG GCCCCTGGAG 
TGTCCCCTTC CTGCCAGCCC CTCGCTGTGG 
AAGTCACCAC CTGCGAGCCT CATGAGCCGC 

10 CACTGTCACC GGGGTGTGCT GTGCCCCCCT 
TTGGGACCAT CTTCCCTGTG CGTCCAGGTG 
CTGTGTGTGG TTGTGAGCTG TGTCCTCCTC 
TGTCTGAGAA GAGGCCGGCA GAAGAACCGC 
GCCACCCGGT GCGTGAGCTG TCCCTGCACC 

15 CGCAGCCGTG GGTGCCCCCA GCCACGGCTG 
GGAGCTGATG TCCAGGGCAG CTGTGATGCT 
TCTGCAGGGG GCTCATGAGT CCCAGCTGGA. 
TGGGGCCTGC AGCACTGGGC TTGGCCCTGG 
CTCATGTGAG GGACACAGCC CATTCCTAGC 

20 GCCAGGAAGC CCCCTGCCTG CTGGAAAACC 
TGGCTTCTGG GAAGAACTGG GTGGTGCCCA 
GCTGCTGAGG GGCCTGGGCT GGCGAGGGCC 
TTGGTTTCCA CGTTTCCGTG TTGGTCTGGG 
GAAGCCCCTC CTTGTGCGCT GCCATCCTGG 

25 GGGCTTGAGG GAGGAGGAGA GAGACGGGCC 
ATCTACCCTT TGCCCATTCA CTGCTCTCAG 
AGACGGGCCC AGCCCCACAG CTTTCTTCCC 
CAGTTTCTCA GGCAGCAGCC CCACCTCGGC 
CAAGGAAGGC TCTGCAGCCC CAGGGACCAG 

30 GCCTGTGTCC TGCAGGAGGA GCCCGTGGCT 
ATGGTGGCCG TGAAGCAGGA GTTGAGCCAA 
CAGGACTACA AGGGTTCCGA TGACTTCGCC 
GCTGAGGACC CCAAGAAGCA CAACCTGCTC 
CCCTGAGGGC AGTGCTGCCG CCGCGTGTGG 
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GGGCGGGGGA 


TCCCAGCTGC 


CTGGCTGTCT 


80220 


CTGCTGGCTC 


CCAGGTGGTG 


CATGCCCTGG 


80280 


GTTCTCTGGC 


GGGTCAGCTT 


GGCTCAGTGC 


80340 


GGGCTTGAGG 


CTGGCCTTTC 


TCCAGGAATG 


80400 


CTTGGGGCCT 


TTTGCCCCAG 


AAGCCCATAA 


80460 


GCTTCCTGCT 


GGGAGCCATT 


CCCCTCTTCC 


80520 


TGAGCTCCAG 


GTGCAGGCAC 


CAGGCACCTG 


80580 


TCGGACTGTC 


TTCCCTGGAC 


CTGCTCTTAC 


80640 


TGGTGTGACT 


TGGACAGGAC 


CAAGTTGTGG 


80700 


CCCCCGACCT 


CeATCTTGGC 


TCAGGGCTCC 


80760 


CTTTGGGACC 


CCAGAGTGTG 


TGGTTGGGGT 


80820 


AGGCCCACAG 


CTGCTCCACC 


CTGTCCCTCC 


80880 


GAGGAGGGAG 


GAAGAAGATC 


CGGCTGGTCA 


80940 


TGTGCCGACC 


ACCATAGACA 


CGCATGGGAA 


81000 


GTCCCGATGG 


GACCAGGGAA 


TCCACCCCCA 


81060 


GACGGCCAGG 


GGCTCAAGTG 


TGTGGTTTCT 


81120 


ATCAGGCCCC 


ACCCTTGGGC 


AGGTTTGGCA 


81180 


CATTTCCCTC 


AAGTGTGGAT 


GCACACCTGC 


81240 


CTTGGATCAA 


AGAACGGAGT 


TATAGCCGGA 


81300 


CCAAGTGTGG 


CGGCCTTTGT 


CCATGTCCCT 


81360 


GGCAGGGCTG 


GTGCCATCAG 


GAAGTGGGTG 


81420 


TGGGTGGGGA 


GTGCCTGGGC 


CGCCCCTGCC 


81480 


GTGTGTAGAG 


AGATGGGCAC 


TGCTCATCCG 


81540 


GAGCCTCAGC 


CGCATCCGCT 


GTGGGGCAGG 


81600 


ATGCAGGACC 


CCTGGCTTGA 


GGCAGAGCCA 


81660 


TTCCCTGCCA 


GCCTCTCACT 


GTGTGACCTC 


81720 


GCAGCCCCTC 


CCTATGTCCA 


TCCAGCCAGC 


81780 


AGTCACTGTC 


CCAGGGAACG 


CTCAATGTTC 


81840 


ATGATGAGGC 


TGGCCCTGAT 


GGAGCCTCGG 


81900 


GGTGCACAGA 


CGGACAGGGC 


CAAGCTCTTC 


81960 


GCCAACTTTG 


CCACCTTCAC 


CCAGGCCCTG 


82020 


GCCCTGGCCG 


CCTGTCTCGG 


CCCCCTCTTT 


82080 


CAAGGTGCCC 


TGGCTTGCAG 


AGGGCACCCA 


82140 


GGTGGGGGCC 


ATCTGGGTCC 


AAGGTGGTCT 


82200 
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CTGTTCTCTA GAGAAAAAGG GGCAGATGGG 
CCAGTTTGTG CGGCCCCACC ATAAGCAGCA 
ACGAGGCTGT GGCTATCGGC CTGAGCACAG 
CCTGGACCCC ACTGGTAAAT GGGGCCCCAG 

5 AGTGTGGGCC AGAGTCCTGG GCTGCTTGGG 
CCTCTCTGTT CCCCTATGGG AGTGATGGGG 
ACCACCTCCA CCTTCACCAC CACCACCTCC 
ACCACCTCGA CCACCTCCAC CACCTCCACC 
ACCACCTCCA CCACCACCAC CACCACCACC 

10 TCCACCTCCA CCACCACCAC CACCTCCACC 
ACCACCTCCA CCACCACCAC CACCACCTCC 
GGAGACCCTG TGCAACTCCA TGCACAGCCC 
CAGCCCTGTC CAACTGCCAC ACGTCCCCTG 
GCACACGCCA GGGTCCTAGG GTCCTAGACC 

15 GGTGGAGTCC AAGTCTCCAG AGGCGGAAGC 
TACAGAGTTC CCCTAGTTCA CCCAGGGGGG 
CTTCTGGGCT TGGTACTCAC TGGGATATCC 
CTCCTGAGTG GGGCTTTGGC CTGCCCGCCA 
TCTCCTCTGA TGCCCCCAGC ACCCAGGCGT 

20 GAGGTGGGTG AGGCCTGGCC TCCCTAGCCA, 
AGATGCTCCC GACCAGCTTT GTGGCTCTAC 
CeCAAGCTGA CCGTGTCCAC GGCTGCAGCC 
CAGGGCAGGC CCCACCTGTC GCCCAGGCCA 
GTGCAGCCTA. CGACTTGGTG GGTCCCTCAG 

25 CCGGGGCTGC CCCTGTGGGG AGCCATCTCA 
GCAGTTGTCC TGAGCAGCTC TCCAGGAGTT 
CTCTCAGTCC TCCACCCCAG CGCCACTCTG 
GGCAGCCAAC CACAGTGGGG GTCTGGAGTG 
GTGAGCGCCT ACCTGGCTGA TGCCCGCAGG 
'30 TTGGCAGCGC TGACAGCCTA TAAGCAAGAC 
GCCGCCCTGA CCACTGCAAA GCCAGAGGAC 
CGTGGGGAAC AGCCGGTGGG GTGGGGGGCA 
ATTGAAGCTC CCCGCAGGGT TCAGCATGTT 
ACAGACGTGC ACAGACCTGA CCGGCCGGCC 
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GACAGACGCC 


CCTTCCTCTA 


CAGGCTTCTA 


82260 


GTTTGAGGAG 


GTCTGTATCC 


AGCTGACAGG 


82320 


CATTCCCCGA 


AGGCAGCGGG 


CACAGCCGGT 


82380 


GTGGGACCCT 


CAGACTCCTG 


CGTGGAAGGC 


82440 


GTGGGCATCC 


TCGGGCCCTG 


CTTGGCCCCG 


8250O 


GCCTCCACCT 


CCACCACCAC 


CACCAGCAGC 


82560 


ACCACCACCA 


CCTCCACCAC 


CTCCACCTCC 


82620 


ACCACCACCA 


CCTCCACCAC 


CACCACCACC 


82680 


ACCTCCACCT 


CCACCACCTC 


CACCACCACC 


82740 


TCCACCACCT 


CCACCTCCAC 


CTCCACCACC 


82800. 


ACCTCCACCA 


GCAGCAGCAT 


CACTTGTTGG 


82860 


TGTCCCTGCC 


ATAGCCCCGA 


CCCCTAAGCA 


82920 


CCTCCCATGC 


ATGGTCCTGG 


GGGGTCAACT 


82980 


CCTGTCCTCC 


CTGTTTCTGC 


CTCTGTTTGG 


83040 


ATCTGTGTTC 


GTGTGTTAAT 


GAACAGCCCC 


83100 


AACCTAGCCT 


GTTGGGACGA 


CCCCAGATCC 


83160 


TCATGCCTGC 


ACCCAGCCTA 


CGGCTCTGAG 


83220 


CTGTTCCAGC 


CCCCATCCAG 


CAGGCTGGTG 


83280 


GTACCTGCCT 


GGGTTTTCCC 


GCCCTGGTCT 


83340 


GCCCTGCCCC 


CCCACCCCAG 


GGAACTTTCC 


83400 


ATCTCTTCAT 


CAGGAAGAAC 


GGCGCCGGAT 


83460 


CAGCAGCTGG 


ACCCCCAAGA 


GCACCTGAAC 


83520 


CCCCCAACAG 


GTAGCTGACT 


CCTGAACCGT 


83580 


TGGCTTCACG 


AGGCTAACTC 


TTGAGTGTGG 


83640 


TGGTGGGGAC 


TGCTCCCGGT 


TCTGCACCCC 


83700 


CCTGGAGGAA 


GGGCGGGCAG 


GGCGGTGGGA 


83760 


AGCCATGCTA 


CTCCCACACC 


AGGAGACCCT 


83820 


CCCAGAGCAG 


GGAAGCAGGG 


CCAGCACGCC 


83880 


GCCCTGGGGT 


CCGCGGGCTG 


TAGCCAACTC 


83940 


GACGACCTCG 


ACAAGGTGCT 


GGCTGTGTTG 


84000 


TTCCCCCTGC 


TGCACAGCAA 


GTGGCCCTGG 


84060 


GGGGACAAAA 


TGGGGGCTGT 


GCCGGGTCTG 


84120 


TGTGCGTCCA 


CACCACAAGC 


AGCGCTTCTC 


84180 


CTACCCGGGC 


ATGGAGCCAC 


CGGGACCCCA 


84240 



-62- 



wo 01/42434 



PCT/USOO/33065 



GGAGGAGAGG CTTGCCGTGC 
ACCTGCCTGG CTGCTCCTGG 
AGCCCCAGGC CCCCCTCAGA 
CCGGGAAGAC CCAGAGCAAG 
5 GGGCGGGCGG TGAGGATGCA 
CATCTGAGTG GGGTGAGCCT 
CGGGCGGGCC CCTCTCAGCA 
CCTTCCAGTG CCCTGCCTGT 
AGGTTGGTGC' CTGGCCACTA 

10 GGAAAGGGCA GGCCCTTGTC 
GTCTCACAGG CCTCTAGGAT 
ATGCAGGTCT TCTGGCCAGA 
GTGGCTTGAT CACCTGCCTG 
CCAGCCCATG CCAGCCGGCT 

15 GTCCCTGCGG TGGGACCGGA 
CTGGGGTGGG TTTCTGGGAA 
GGTGCCACAG GGGCACCCCA 
CGTGTCCACT TTTAAtCAGG 

cggtgtcttc gtggcctggg 
20 . ccctcccagc ctgccatgtg 
tcctcacagc cgcagccaag 
gggctggtct ctgggttggg 
gaggcccctc ccagaaccac 
gaggggggct gcctggggag 
25 tctccccgcc catatatgta 
ccgcaggcgg accgggggca 

TCCGCGCTGA GCCGCGCTCT 

ggcctgtcgc tgctgtgcct 
cgcggagtgg cagaaacacc 
30 gtgtgtgccc agtgcccccc 
acgacgtgtg gcccgtgtcc 

TGCCGCTACT GCAACGTCCT 
ACCCACAACC GCGCCTGCCG 
GAGCACGCAT CGTGTCCACC 



CTCCTGTGCT TACCCACAGG 
CAGCGCCCCA ACCGCACGCA 
CTCAAGTCTC TGTCTCCAGG 
ATCTCGTCCT TCCTTAGACA 
GGTCCCAGCC AGTCCTCAGG 
CATGGGAGAG ACATCGCTGG 
GGCTGTGTGT GCCAGGGCTG 
GAGTTCCAGC GCTGCCAAGC 
CAGTTCCTGC TGGGTGTAGC 
CTGGTGGCAA CGCCTGGCAG 
GTGCCCAGCC TGCCACACCG 
GCCCCAGTGA GTGCCCACGG 
TCCAGCTCTG GTGGGCCAAG 
TGGCCCGCTG CAGGCCTCAG 
TCTGGGCCTG CCTCTGAGAA 
AGTGCTTCCC CAGAACTTCC 
GCTGAGCCCC TCACCGGGAA 
GGACAGGGCT CTCTAATAAA 
CTTGGTGGTG GGAGTTGAGG 
ACACTGTACT TCCTCCACGG 
CTGCAGTTGG TAGGGGATCC 
AGCTGCCCCA GGTGCTGAGG 
CCTTGGACTG AGCTGTGGGG 
GGAGGGGTTC CTATGGGGCG 
GGGCAGCAGC AGGATGGGCT 
AAGGAGGTGG CATGTCGGTC 
CCCTGCTCCA GCAAGGACCA 
GGTGTTGGCG CTGCCTGCCC 
GACCTACCCC TGGCGGGACG 
AGGCACCTTT GTGCAGCGGC 
ACCGCGCCAC TACACGCAGT 
CTGCGGGGAG CGTGAGGAGG 
CTGCCGCACC GGCTTCTTCG 
TGGTGCCGGC GTGATTGCCC 



GCTCCCCAAC 


CAGGTAGGGC 


84300 


GCCCTGGGAG 


TGAGCAGCAA 


84360 


CCCCTCACGG 


TCCGAGAAGA 


84420 


GAGGCCAGCA 


GGGACTGTGG 


84480 


ACCTCCCCAC 


GGGCCTGCAG 


84540 


GCAGCAGGCC 


ACGGGAGCTC 


84600 


TGGGGCAGAG 


GACGTGGTGC 


84660 


CTGGTGGCAA 


CGGCACCTTC 


84720 


CCCAGGTGAT 


GGGCTGAGGG 


84780 


ACGTGTGCAG 


TGGGCCGGTT 


84840 


CCTCCAGGAA 


GCAGAGCGTC 


84900 


AGGCCCCCAG 


CACACCCAAC 


84960 


AACCCACCCA 


ACAGAATAGG 


85020 


GCAGGCGGGG 


CCCATGGTTG 


85080 


GCCCTGAGCT 


ACCTTGGGGT 


85140 


CTGGCTCCTG 


GCCTGTGAGT 


85200 


GGAGGAGACC 


CCCGTGGGCA 


85260 


GCTGCTGGCA 


GTGCCCAGGA 


85320 


GAGAGGGAGT 


TGGCAGAGGC 


85380 


TGGGCTCAGC 


CCTGCCCTCA 


85440 


ACCGACACAC 


CAGGCTGCCT 


85500 


AGGGCAGCTC 


CCTGGCTGGT 


85560 


AGGGATGGTA 


CCAGGTGGGT 


85620 


TGGCGAGGCT 


GGCCCAGCCC 


85680 


TCTGGACTTG 


GGCGGCCCCT 


85740 


AGGCACAGCA 


GGGTCCTGTG 


85800 


TGAGGGCGCT 


GGAGGGGCCA 


85860 


TGCTGCCGGT 


GCCGGCTGTA 


85920 


CAGAGACAGG 


GGAGCGGCTG 


85980 


CGTGCCGCCG 


AGACAGCCCC 


86040 


TCTGGAACTA 


CCTGGAGCGC 


86100 


AGGCACGGGC 


TTGCCACGCC 


86160 


CGCACGCTGG 


TTTCTGCTTG 


86220 


CGGGTGAGAG 


CTGGGCGAGG 


86280 
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GGAGGGGCCC CCAGGAGTGG 
CTTGCACCCT GAGCTAGGAC 
CACCCCCAGC CAGAACACGC 
TTCCAGCTCA GAGCAGTGCC 
5 TGTGCCAGGC TCTTCCTCCC 
CACCAGGGTA CCAGGTGAGC 
TTGTGCCCTC ACTCCTGCCC 
GGAAGGGGCC ACAGTGGATT 
TGCeCTCTCA GGGGTGGCTG 

10 GCTCTGGGAA AGTGGGCAGC 
GACACGGAGA AACCGAGGCC 
CCTCCTCTGA CATGGGGAAA 
GAAGTGACCC CCTTAACATC 
TCCGATGGCC CAGGAGCCCC 

15 CCGAGTGGCC ACTCCTGCCC 
CCCTTCTCCA GTCCCCATCG 
TCTCTCTCCT GCAAACCCCC 
CAGTGTGTGT GGGTGAAATG 
AGTGTGAGCG TGCCGTCATC 

20 AGCGGCTGCT GCAGGCCCTC 
GCGCGGCCTT GCAGCTGAAG 
GGGCGCTGCT GGTGCGGCTG 
GGAGCGTCCG TGAGCGCTTC 
CATCCTTGGC ACCCCACTTG 

25 TCTTAAAGCT TATTTTTATA 
GGAGGGCAGC CGGGGACACC 
ATCAGAACTG CTGCCCGGGG 
GGGGAGTGCC ACCTCTGCCC 
CGAGGGTCAG GGCCTGGGAG 

30 CCTAGTCGCC GGGGCCCACA 
AGGGGATTTC CCGAGAGGGG 
GAAGTGAAGC TTCTCCCCCT 
CCAGGGGGCT GCACCAGCCA 
CCCCCCCAGG TCCTGGGAAC 



TGGCCGGAGG TGTGGCAGGG 
ACCAGTTCCC CTGACCCTGT 
AGTGCCAGGC GTGCCCCCCA 
AGCCCCACCG CAACTGCACG 
ATGACACGCT GTGCACCAGC 
CAGAGGCCTG AGGGGGCAGC 
CTGCACGTGC ATCTAGCCTG 
TGAGGGGTCA GGGGTCCCTC 
AGAATTTGGA TCTGAGCCAG 
AATCTCCTAA CTGCCCGAGG 
TGATGGTAAC TCTCCTAACT 
CCGAGGCCCA ATGTTAACCA 
AAGTCAGGTC CGGTCCATCT 
AAGCCCTTGC CTGGGCCCCC 
CCTAGGCCTT TGCTCCAGCT 
TTGCACTGCC CTCTCCAGCA 
CGAGTGGGGC CCAGAAAGCA 
ATCGGACCGC TGCCTCCCCA 
GACTTTGTGG CTTTCCAGGA 
GAGGCCCCGG AGGGCTGGGG 
CTGCGTCGGC GGCTCACGGA 
CTGCAGGCGC TGCGCGTGGC 
CTCCCTGTGC ACTGATCCTG 
CACTGAAAGA GGCTTTTTTT 
AAGCTTTTTC ATAAAACTGG 
TGAGCCGCCC GCTGTGCCCA 
CTTCCCCTAC CTCAGACAGA 
CCAGTGGCTG TGGCACGTGG 
GGTATCATTG CTGGAAGAAC 
CTAACCCCCC ACTTATGAAT 
ACCTGCCAAA GACCTCCTCC 
CTGGGGCAGG CTCTGAAGCC 
CTCGCCTCCC CAGCACGGCC 
CAACCTGCAG AACACACACA 



GTCAGGTTGC 


TGGTCCCAGC 


86340 


TCTTCCCTCC 


TGGCTGCAGG 


86400 


GGCACCTTCT 


CAGCCAGCAG 


86460 


GCCCTGGGCC 


TGGCCCTCAA 


86520 


TGCACTGGCT 


TCCCCCTCAG 


86580 


ACACTGCAGG 


CCAGGCCCAC 


86640 


AGGCATGCCA 


GCTGGCTCTG 


86700 


CACTAGATCC 


CCACCAAGTC 


86760 


GGCACAGCCT 


CCCCTGGGGA 


86820 


GGAAGGTGGC 


TGGCTCCTCT 


86880 


GCCTGAGAGG 


AAGGTGGCTG 


86940 


CTGTTGAGAA 


GTCACAGGGG 


87000 


GCAGGTCCCA 


ACTCGCCCCT 


87060 


TTGCCTCTTG 


CAGCCAAGGT 


87120 


CTCTGACCGA 


AGGCTCCTGC 


87180 


CGGCTCACTG 


CACAGGGATT 


87240 


GGGTACCTGG 


CAGCCCCCGC 


87300 


CCCCACTGCA 


GGAGCTGAGG 


87360 


CATCTCCATC 


AAGAGGCTGC 


87420 


TCCGACACCA 


AGGGCGGGCC 


87480 


GCTCCTGGGG 


GCGCAGGACG 


87540 


CAGGATGCCC 


GGGCTGGAGC 


87600 


GCCCCGTCTT 


ATTTATTCTA 


87660 


TAAATAGAAG 


AAATGAGGTT 


87720 


TTGTAGTTGC 


ACAGCTACTG 


87780 


GATCCCTCAG 


GCTGCCTGCC 


87840 


CCCTCCCTGG 


GAGGATCAGT 


87900 


CAGGGGCCCC 


TGAAGCTCAG 


87960 


AGGATGGGGC 


TCAGGCCAGC 


88020 


TCCTCCCACT 


CCCAACTCAC 


88080 


AGGCCTCCCA 


TGCTTCCCGG 


88140 


TCCCGATGCA 


CCCAGAGCAA 


88200 


AGGTTCCCGG 


GGCTGGAGGT 


88260 


GGGTCCCCTG 


GAGAGGACGC 


88320 
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GGGGACTTCC AGGGCCCGAC TCCTGTGAGT 
CCCTGACTCA TGCCCCTTCC CAGCAGCTCC 
CAGGAAGGGA GTGCCTGGAC TCTCCAGGCC 
CCCCTGAGCA GCCCAGTAAC ACCATCACCG 
5 GGAGGGAAGG GATGGACAGA GCCACACTCG 
CACTGGGCAG GCCGCTCCTG CCCACAGCCA 
CCAGTGGGGT CCGTGCAGGG AGGACCCCAG 
GGGCCCCACA GAGAACCCCT CCGGGAGGTT 
CCACAAACAC TCCCCAACTT GCGGGGCTGG 

10 CCTCAGCTTT TTGCATCAGT CAGCTCCCTC 
CCAGGCGTGG GGTATAGGTC TTCCCCCGCA 
AATCCAGGGC CCCCGCTGGC CTTTGGGAAC 
GCCCCAGACC ACTTCCGGGG TCACGGGGTC 
TGACGGGCCT CCTCCTCGAT GGGGCGGAGA 

15 GGCTGTCCTC ATGCAGCCCA AGCCAGCCTG 
TCCCTCAGAC CCCCCAGAAA GGGCCTCGAA 
CCGCTGCAGG GCCTGGGCCA GCCGGGCTGC 
ACGCTTTTCC AGACATAGAG GAAAGTTTGT 
CCAAAGCAAA GCAAACCCAC CCCCCAGATC 

20 CTCCAGCACC AGGGGACCAG CCGTCCCGAC 
TCTGCCGCGG CGGCCGGTGC ACATTCCGCA 
CGCGCACCCC TTTGCTGTGA AGACAGCGGG 
CCCCGCAGGT GCAGCCCCCA CTCACCCTGT 
GCGCCTGCCG ATCTTGCTGG TGCAGTCGCT 

25 GCACGTCTGG GGGAGGTAAG GCCGTGAGGA 
GTGGGCCCGG GACTCTCAGA AGGGCGTATG 
CACCAGGGTC CCAGTGTCTC CCACAGAGAC 
AGGCTGAACA TGGCTGGTAG TGCCTGAGAC 
GCAGCCCCCA TCCCCTCCCT GGGGGACACC 

30 AGGCCTCCTG GGCAGTGGCG GAGGCAGGTC 
GAGGTGGGAG CCCTGCATCA GTGATGGGGG 
AACCACCTGC CCAGTCTTCA AAAAGCAGCC 
CCCTGCCCCG TACCAGATGT CCCTCTTGTC 
AGGCTGGGGG CAACAGAGCC ACCCCAGAGA 
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CACAGCCCCG 


CAGCTGCTGC 


GCCACCCCCA 


88330 


TCCCAGGACC 


CCATGTCCTT 


CCCACATCCG 


88440 


CACCTGGGGA 


GCCCCTCACC 


TGCCCACCAG 


88500 


TGTCCAACAG 


CCAGGAGCCT 


CCACCCTCCA 


88560 


CCGTCTTTAT 


TTTGCACTCA 


CCCTGGGTGA 


88620 


GACTGAGGAA 


GAACACAGCA 


CTCGGCAGGC 


88680 


GACCAGCCTT 


ACTCCCGAGC 


AGGGGACACA 


88740 


CTCTCCTGGC 


TGGGGGAGGG 


CTCTGGACCC 


88800 


GGCATAAAAA 


CAGCCACTCC 


CAGCAGGCCC 


88860 


CCGGGGGATT 


AGGGTGAGGT 


GAAGCCAGGC 


88920 


GGCCTCAGCC 


CTGTCCCGAG 


GCTGCATCAC 


88980 


ATGGCCTGGG 


TCTTCCTCAA 


GGCAAGATCA 


89040 


ACAGGGCAGA 


AGCCAGATGG 


CAGCCATGGC 


89100 


CAGCCACGGG 


GTCTCCCGAG 


GGTCCCACAG 


89160 


AGCACTGGAG 


CCCCAATTCC 


CAACCAGGTC 


89220 


AGGCCGCCGC 


TGCGCCCTGT 


GGAAAGGCTG 


89280 


CAGACTCCCC 


TCCAAAGCCT 


CCGGATGCCT 


89340 


CTTCGAGAAA 


ACAAAGTAAA 


TAGAAGAACC 


89400 


AGCAGCATGG 


GAGCCAACAG 


GAGGCCACTC 


89460 


GGCAGCGCGG 


CTGCGCCTAC 


GTGATGTCCC 


89520 


CGACACACTT 


CACCATCCAC 


TCGATGCCCT 


89580 


TGTGAGGCGG 


GGGGTCTCGG 


TCCCCAAAGC 


89640 


GAGGGCCGAG 


CAGGCCTGGG 


TCAGGCAATC 


8970O 


GAAGGCCGTC 


TTGATGTCAG 


GGATTGAGAG 


89760 


GCAGCCCCCA 


CGTCTGGCCC 


TGTCCTGCCT 


89820 


CCCTTCACCC 


CAGGGAAACA 


GCCAGAGCTC 


89880 


CACAGCAGTG 


AGGACCCTGT 


GCTCAGCCCG 


89940 


AAACTAGACG 


TCCACACGGC 


TCCAAGGAGT 


90000 


CTGAGCCCCG 


AGGTGGGGCG 


CTGAGGACTG 


90060 


CCAGGGGCCC 


ACACAGCCGG 


GGATGATGGA 


90120 


CAGTCTGCAG 


TCATGGTGGC 


TTCTGCTCAC 


90180 


CTCCCCTCCC 


CTTTTCCTCC 


GAGGGGAGAC 


90240 


GGCTGAGATT 


GTAGGGGAGG 


CCAGCCTTAC 


9O3O0 


AGGCAGGAAG 


TGAAGATTCA 


CCCGGCCCTC 


90360 
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TGGACGCCGG GCTGCTTCTG TGCAAAGCCA 
CCAGTGCTCC CGGGGGCAGT GGCACCTCAG 
GGCTGGCACA GATGCCACCT CCAAGGGTAG 
CACCAGGGCC CCAGCATCAC TTCTTTGAGC 
5 TGGCCACACA CTCCTGAGAC ACGGCCGCCC 
AGAGGCCTCT GGGCGCCTGC CCTGAGGTGG 
TGGAGAGGAA CCAAGCTTGG GAAGCTGCTG 
TACCCCCTGG GCAGTCACTG GGCCTGGGGT 
GGGGGAGGCT GGGGACTCGG GAAGCTGCTG 

10 TCCTGCTTGT TGGCCAGCAC CAAGACGGGG 
TTCTCTGGGG AGGGCAGGAG AGGCAGCGCC 
AGGGGCCCAC AGGCGTGGAC ACTGTGACAG 
CTAGGCAGGA GCACTTCTGA CCAGACACTG 
TGGACCTGCC CCCACTCACC ATCCATCCCT 

15 GCCTGCTTGG ACTCAGCCAG CCTCTCCTCG 
TGACACTCCG CATAATACTG GGAGGAAGCA 
CAAGGGCCAG CAGAGCCAGG CCTGTGTCAT 
GGCAAGAGGG CAGCCCCCAA CTGCAAGACC 
GATCTGGTGG AGGTGTCCCC AGACGCTGCT 

20 ATGGTGGCCA AAGGTGAAAG ACAGGGATTG 
CCAACACTTT GGGAGGCAGA AGCAGGAGGA 
GCAACACAGT GAGACTCCGT CTGTACAAAA 
CTACTCGGGA GGCTGAGGTG GGAGGATGGC 
CTCTGATCAC ACCACTGCAC TCCAGCCTGG 

25 AGATAGGGTT TGGGGGCTGG AGGAACCTAG 
CTGTGGGGCT CTGTGCCAGC ACCTCGCACA 
GACCAGCACC ATCTACGAAG CGAGCCTTCC 
CAGACCTCTG CAGCCTGGGC TGGTCAGTCC 
CAGGGCACAG CCCCTGCAGC CTCAGGTACA 

30 CTCTATCTCA ATCAGAAAAA AACACAGCAA 
TCACATTTGC CACTTTGCCT CCAGGCTGTG 
GAGACCTAGA ACTGGATGCT GACCAGGTAG 
ACACACCCGA GAGCATGAGA AGCCAGGAGG 
GTCTGAGCAG GGAGTACCAG ACAACCCCAT 
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CTCCAAGAGA 


ACAGCTAGAA 


CTCAGCGTGG 


90420 


AGGGGTCTTG 


AGGGGCTGCC 


CTGGGGGTGG 


90480 


CAGGAACAGG 


TAAGGGTCAG 


AGGTGACTCC 


9054O 


TCTGAGTTTC 


ACCTGGGTGT 


CCCCACAGCT 


90600 


TCCTGGGGAG 


AGGTGCCCTG 


CATAGCAGGA 


90660 


GAGAACCTCC 


AGGGCTGGCA 


GCAGCAGGTC 


90720 


GGGGCAGGGC 


AGGCCTTGAG 


AATGGCTCTG 


90780 


GTCTGGGTGC 


ACACCTACTC 


CCCTTGCTGT 


90840 


CGGGAGGCAG 


GGGTGGGGCT 


CACCTCCACA 


90900 


ACACCGCACA 


GCGCCTCGCT 


GGTCACCACC 


90960 


TCACACCCAG 


CATCCTGCCT 


CTGACTGCCC 


91020 


CCACTCCCTC 


TGCCCCCCCC 


CCGTCACCCA 


91080 


AGCCTGCCCC 


AGGCACAGAG 


CTGCCCAAGC 


91140 


CCCAGAGCAG 


CCAGGCCGCA 


CTCACCAAAC. 


91200 


TCGGTGGAGT 


CAATGACGTA 


GATGACGCCG 


91260 


CCAGGAGTTG 


GGGCTCAGTC 


CCCACCCTGC 


91320 


GGCCACAGTG 


AGGGGCTCAC 


ATGAGGAAGG 


91380 


CTTCTGGGAT 


GCATTCTGGG 


GTTGCGGGGA 


91440 


CCTGAGAACC 


TGCCGGCAAC 


CTTTGGGCTG 


91500 


GGCCAGGCGT 


GGTGGCTCAC 


ACTTATTATC 


91560 


TCACCTGAGC 


CCACTTCACG 


GCCAACCTGG 


91620 


GCTTATGGTA 


ATGTGCGCCT 


GCAGTCCTAG 


91680 


TTGAGCCTGG 


GAGGTTGAGG 


CTGTAGTGAG 


91740 


GTGAGAATGA 


GAGACCCTGT 


CTCAAAAAAA 


91800 


ACCACAGCCT 


GGCCCGTTGA 


GGGAGTGCAC 


91860 


GGGAGGGAGT 


GTGGCCATGC 


GGATAAGACT 


91920 


CTGCCAGGAC 


AGGGCCAGAG 


TCACTGAGCT 


91980 


TGGGCTCGCT 


GGCAACACTC 


CTGGGCAAGA 


92040 


AGTGCTGAGC 


CCTGGACCAG 


ATGAGTGCAC 


92100 


ACTCCGCGTC 


CACGTGGAGC 


AGACAACAGC 


92160 


CCAGCTCTCC 


TGTCCAGGCA 


TGAGTGCCCA 


92220 


GACAAGCTGG 


TGGTCAGTGT 


GTTAAGACAC 


92280 


CACAGCCCAA 


CTCTCCGAAA 


TCCTTAGGGT 


92340 


CCCAGTGCCA 


GACAAGCTTG 


TGCACCTGCA 


92400 
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CTTCCCACAG AGGAGAGAAG 
CAAGGCCAGG CCCCCCCACC 
GCGCCCTCCA GGGTTCTGCA 
ACAGACAGGT CCCACACAGA 
5 TCTGCTGCTG ACACAAGGGA 
GCGGCTCTGT GGAGGCTGCA 
TCTGAGCAGG GTGGCAGAAG 
CCAGAGCCAC CCATCACACC 
ACGGGAGCGA CACCTCAGGG 

10 GGCGTCACAG CAGTGGTCAT 
CAGCGTGGCT CCTGGCTGGC 
GAGTTTTTCT CCCCAGCAGC 
AGACTAGTGT TACAGCAAAG 
ACACCAACCC AGGGTCCAGC 

15 GTGCTGGCTC TGCACAAGGA 
TCCGGTCCCT CCCAGTGACC 
AGCAGGGTCA GGGGTCCCGG 
GAGCCACAGC TGCCTCATGG 
GGAGATGGGG CTGCCATCCC 

20 CCACCACGGT GGTGTCTGGA 
TGGTAGTGAA TTGGGAGGGC 
GATGGGCTAC AGTGGCCCCC 
CAGGACCTTG GAGGAACCCC 
GTGGCCAGTG CAGGCATCTC 

25 ACCAGCAGAT GGTCTCATTT 
TGCCAGGTCA CACCTGCCCA 
GTCCCGGCCA GCCTGGCCCC 
CTGGGCCAAC CACAGCTCTG 
CCCTGCGTTC CTGGTGCCGA 

30 TGTGCCTGAG ACACCGGCCT 
CTGCGCACTG GACACACAGG 
AGGATCTATC AGAGATCACT 
AGGAGGGGAC ACTTTGGGGG 
ACCTGGGCAC TCAGCTCTAG 



CCTGTGCACC TGCACTTCCC 
CCCAGGAACT TCCCACAGTG 
GGTAGCGAGG CCCCCCCACC 
GGCCGCCAAC CCCTCAAGGG 
GCAGGCGGAC CCTAAGGTGG 
GCAAGCCCAG TGAGAGAATC 
GTTCATGTGC AACCGGGTCC 
TACCAGGCAC AAGGTGCACA 
CTGAGTGCGG GCAGGACCCA 
TGTGGGGTGC CCCACAAGGA 
CAGCTGACCC CAGTGGAGCA 
AATGGGAGCT CCCCAACTGC 
AACCCAGGAG CTGAGGTCCT 
CCCAGGACGA GGCCAATTCG 
TGCAGGATAC AGGAACCAGG 
TAGGGGGGTC CCTGCAGCTG 
CCACTAGAGC AGCACATACT 
GCATGACTTG CACCTCATGT 
ACAGCTGTCC CACAGCTGGG 
GAAGGGCTCA GACTGCCAGG 
ACTCAAGGAA GGGCTGTGCA 
TTAAGTCTCC CTCTTCCAGT 
TCTCCAACGT GGAAGTGTGG 
TGGCCCCACT GTATTCTTGC 
TGGTTTCTGT GGGACCCACT 
GGGCAGCCCA CACTGGGGAC 
ACATGGCACA GCATAGCCAG 
CGGATCCTGC TGCCTGAGTC 
CCCCCAGTGT GCATCCTGTA 
GTGACAAGAT CATCATCATC 
CCCTGACTCA ACTTGTCCTG 
TGGCCATGGG AGAAATGTCT 
CGCCTAGGAA AGGGGAACTA 
AGATGGCATC AGGGCAGCCA 



ACAGTGGAAA 


GGAGGAGGCC 


92460 


GAGAGGAGGC 


CCAAGGCCAG 


92520 


CCCAGGAACT 


TCTCTGGCCT 


92580 


ACCCTGCAGT 


GTGCCGGCTG 


92640 


AGACCTCTGT 


GGCAGGAGGG 


92700 


TCCACGTGGC 


TCCTGGGGCT 


92760 


TGGACCATGG 


GACCACGTGG 


92820 


GCCCAGCAGG 


GCCGCAGTGG 


92880 


GAGCCCCACG 


CCCCAGTGGA 


92940 


GGGGGAAGAG 


GGAGGTGTCC 


93000 


GTCAGAGGGA 


CTGTGGGTCT 


93060 


AAAGTGCCAG 


CCAGCCTGAG 


93120 


GGCACATGCC 


ACACATGTGG 


93180 


CAATGACGCC 


CCTTTCTGTG 


93240 


GTGGGAGCAG 


GGGCCTCCCT 


93300 


ATCCTCCCAG 


CTCTGAGCTC 


93360 


CAGCAGACAC 


GCTGAATGAC 


93420 


CTAGGAGACC 


CTGGTGGGCA 


93480 


GACCCAGGGA 


GCCACTGGCC 


93540 


AAGTCGCACC 


CCAGCAGAAG 


93600 


GCCCCAAGAC 


CAGCAGCAAG 


93660 


TTCGCCTTAA 


GAGAGGCCCT 


93720 


GTCCACATAG 


GGCTGCAGCT 


93780 


TTCATGTTGG 


AGAACACTGC 


93840 


TTGGCTGCAA 


AGAGCCACAC 


93900 


CCACCAQGCC 




93960 


TTCTCCTCCA 


GGGCTCCCTG 


94020 


GACCTCTCCT 


CTCCCGTCCT 


94080 


CCTCGACCTG 


TCTCAGCATC 


94140 


TGTGTCACTC 


CCCAAGCATG 


94200 


TCTGACTTCA 


GTGGTCCTAC 


94260 


TCTTGGCTAG 


AAGTCACAGC 


94320 


GGATCAAAAA 


AGAGATCAGG 


94380 


AGGCACTGGG 


GACACCCCAC 


94440 
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ACCCACTGTG 


CCAGCCTAGG 


GCAGGGAGCC 


CGAGGAAGCC 


ACAGGCTCTG 


CCCTGCTCAG 


94500 




TGCTGGACTC 


AGTGCCTGGC 


CCAGGCTGAG 


AAGGAGATAA 


ACTGCAGCCT 


TGGGGGTGTG 


94560 




GGGAAGGGGC 


ACCACACTGG 


GATCTCAGAA 


ATGCCCAAAA 


CCTGTGTCAA AATAGGAGAC 


94620 




TGCCGCTGTG 


AGACCCTGAG 


GAGTCTTCTG 


GTGATCATGG 


AAGAACAAAT 


GTTAAGCTAG 


94680 


5 


AACTGAAGGA 


ACCTCATCAG 


GGGAGAGGCA 


GCCATCCTGC 


CGTCCCCACA 


TCTGGTCTTT 


94740 




GCCATTTCTG 


TGTCCTGTGG 


TGGTCAGCAG 


CAAGGTCTCT 


GAGCCGAAAG 


GAGGCACTCA 


94800 




CTTTGGAGGA 


GTGCAGGGTC 


CCCAGGTCCC 


CACACTTTGT 


CTTGTCCTGA 


CTGAGAAAGA 


94860 




AACAGACTGC 


CCTGACCTCT 


CTGACTTGGC 


CAGCGAGGTT 


GCCCTTAGGC 


TCAAACCCAA 


94920 




GCCAGGGTTT 


GAACATTCCC 


AGACACTTGT 


AAGATGTTTA 


GGTTGTTAAC 


ATAATGTTCA 


94980 


10 


GGTTTCAAAA 


CATTGAAAGA 


AACTAGCCCC 


AGCCCTGAAC 


CCAGATCCCC 


CCCGGCTTCA 


95040 




GGCATGACCA 


GTGAACACGC 


CCTTCTCTCA CTGGTCACCT 


GAGGATGCCG 


CACTCTGTCA 


95100 




ACAGGTTCCC 


CTAATACATG 


CTCTGATCTG 


ATCGCCTTGG 


CATTTAGTGA 


TTCTTTCCCT 


95160 




GGAATTCTCC 


ACTGGCCCCA TCGCAGGGAA CTCCCAAGTG GGAAACTCCC CTACCACCAC 


95220 




TTTTGGGGCA ACTTCAGCTA AGGGTTCAGC TGGGACAAAA 


CAGGGAGCCA 


CTCGGGAACC 


95280 


15 


TGGGACAGGA 


CCAGAGAGAA 


AACCCGAGGG 


ACAGAGTGGG 


TAAGGAAAGC 


TGCTGAGGAA 


95340 




GGGCCCAAAG 


GGCACTCTGG 


AAAGAAGTGG 


CACTGGAGGG 


CTGGGGTGGG 


GGTGGTCCTG 


95400 




GCCAGGGAGT 


CTTACCTTGT 


CCCACAAAGA 


CTGCAGCTCT 


TCCTGCCCTC 


CTAAGTCCCA 


95460 




GAACATGAGC 


CGAGCCTTTC 


CCACATCCAC 


AGTGCCGACT 


GGGGAGAGGA 


GGAAACAGGC 


95520 




AAGGCTCATG 


ACCTTGGTCC 


TCGACACACC 


CAGTCCCAGC 


TCTCCCAGGG 


GATGGGGCAA 


95580 


20 


ACCATGCTGG 


TGCCACTCAA 


ATGAGACTTG 


AGAGGGGCCC 


GACAGGGCTG 


TGGCCACGGG 


95640 




CCAGCTGGAC 


TGTGAATATC 


ACGGCATCCT 


CAAGGCCCCA 


AACCCACAGC 


CTGCTATTGA 


95700 




GACCCTTACT 


GTTTAGGCCC 


ACGGTGGTGG 


TGATTTTGGA 


TAGACTCATC 


CCCTTGTAGT 


95760 




TCTTGTTAAA 


TCGGGTTTTC 


GACTGCTCCA 


GGAAGGTCTG 


AGGAGAGAGG 


CAGAGGCGAA 


95820 




ACACATCAAG 


GAGGGGCTAT 


ACTGGCTTCC 


AAATATCCTT 


ACTCAGGTCT 


GTTCTTTAAA 


95880 


25 


AGACAGAAAC 


AGAAACAGAG 


CAACACTCTG 


CTCTTCAGGA 


GGCTGGTGGT 


GACTATCCTG 


95940 




CCGTCTCAGG 


TGAAATTTGG 


CTTCCGTCTG 


GGTAGTGAAC 


GTGCAGCTGA 


CAGCACAAAA 


96000 




CCGAAGGGGG 


CGCCGCCAGG 


CCGTGGGAAA 


GGTGCGCGCA 


AGGGCGTGGG 


CACTCACCGT 


96060 




CTTCCCAGCA 


TTGTCCAGGC 


CCAGGATCAG 


GATGCAGTAC 


TCGTCCTTCT 


GAAACATGTA 


96120 




CTTGTACAAG 


CCCGACAGCA 


GCGTGTACAT 


CCTGCCCTGG 


GCACCCCAAC 


ATAGGTCAGT 


96180 


30 


GTGCAGCCAG 


AAAGCACCTC 


CCCTCCCCCG 


GGCTTCTCCA 


CGGTGGTCAG 


TGGCGCCCCA 


96240 




CGTCCAGCCG 


ACCGCTCAGG 


ACGAGAGCCT 


GGGGGCCATT 


CCCGACTCCT 


CGTCCCTCTC 


96300 




CCACCCCGTC 


CCTCTGTAAC 


TTCTCCCAGG 


TCAGCCGCCA 


CTGTGTCCTG 


CTCACAGCAA 


96360 




TGACTGCGAC 


CTCTCCGCAT 


ACACATCGGT 


TCCGGCCCCT 


CCCCTGCTCG 


CGGGACTACC 


96420 




CAGCGGGGTG TTCACAGTGA GCTCAGCCGC GCTCCCGCCC 


TCCCCCGAGG 


CTTCGCTCCC 


96480 
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ACGCTTCACG CGCGCGGAAC 
CCTCGAGCGC CAGCCCCGCG 
AGCCCCTGCA GGCGCCGGGT 
CCTGGGGGGA GCGGACTACA 
5 TGGAGGCGGC GCGCGGCTGG 
CGGCTACGTC CGCCCCAAGC 
GCTGGCTCCT GTCACACCCG 
CCTCACCGCA CAGCCTGCGG 
ACCCCCGCTG ACCTCCGCTG 

10 CCGGCCTCAG GGAATGAGCT 
GCCCCCGCGC GCCAAGCACT 
GCTGGGGCGA AAAGGGGGCG 
GCTGCGGCTG GGAAACCGCG 
GCGGGGACGC GGCCCTGGCC 

15 GAAGGCGGCG GGCGCGGCGG 
GGAAGGGGCT CCGGAGTGAC 
GGAAGCGCGG AGCCTGAGCG 
TCTGAGCCGG TGTCTCCCCG 
CTGTGGGGTC GGTCCCGAGA 

20 AGCGCCTCCC TCTCTCAGCA 
TGCGCAGCTG CAGCAGGTGG 
TGACCTGCGC CAGCTGCAGG 
GGTGTCTGTC AGGAAGAGCA 
GGAAGATGCT GAGTACCAGG 

25 AGCGGCCCGT GGGTCCGGAT 
GGCAGGTGGG CAGGAGGAGG 
GAGCGCGCCC TACTACAGCT 
AACGGAAGAG GCGGAGGATG 
CAAGTCTCTG AAGCCGTGCC 

30 CAGGTAAAGC CCTTTGTTGT 
GTCGGGTCAG GATCGGCCCT 
CGTGCTTTGA CAGCTGTGTC 
GAAGCATGGA AGGAGGCCAG 
CCGAGGCGGG CAGATCACCT 



GGGGAACACA CTCGCTGCAG 
CCCCACCCGG GAGCAGCGAG 
AGGGACGCCC CATCACCCCA 
GGGCGGGTGA GCAGCGGCGC 
CAGGGAACGA CTGCGAAGGA 
CGCCGCCGCC AGGTCTGAGG 
CGTCCCACCT CGAGTCCTGG 
CCTACCTGCG TCCGCCGCGC 
ACCCCGCGCT AACCCCGCGC 
GAACCGCGTC CCAGCGGCCT 
TCCGGAAGCG GCGGCGCTCG 
CCGGGCCGCT CTAGCCGGTG 
CGGAGGAGGT GCCCGGCCGG 
AGACAGAAGA GACGCCGGGC 
GTGGGCTCGG CGGAGGGTGA 
GCGGGACCCG GCTAGCGGCG 
GGGGTACCCG GGCTGCGACC 
AGGGAAAGGG GACGTGCCCG 
CGCGGGGCTC AGCTGGCTTC 
TGGACGAGGA GAGCCTGGAG 
AGCTGGCCTT GGGCGCCGGC 
GGGACCTGAA GGAGCTCATC 
GGTTGTTGGC CGCGCTGGAC 
CTTTCCGGGA GGCCATCACT 
CAGAGACCGT TCCTAAAGCA 
AAGAGGGAGA GGACGAGGAA 
CCTGGGGCAC TCTGGAGTAT 
GCTCGGCGGG TGTCCGTGTG 
CGTTCTTCCT GGAGGGAAAG 
CAGATGCCAA CCTTAGGGGC 
CCCTTTGCTT TGCAGTTTTG 
TGTGTTCAGG CGTCTACGTG 
GCGCGGTGGC TCACGCCTGT 
GAGGTCAGGA GTTCAAGACC 



CCCCGCCTGG 


GCCACGGCAC 


95540 


GCAGCGGCGC 


GCTCCCCAGG 


96600 


TTTCTTAAAA 


CGGGGACGGC 


96660 


GGCTGCTCCT 


GGAGTGCACC 


96720 


AGAACCTGGG 


TCGCGGCCCC 


96780 


CTCCCCGACA 


AGCAGCCAAA 


96840 


GCCGCCCCTC 


GGGCCTCGCG 


96900 


CCTCGGAGCC ' 


GCTGCTGCTG 


96960 


GGCGCCTGAC 


GGGACGCGGG 


97020 


CCGCGCTCCG 


CTTCCCGGCT 


97080 


GGAGGAAGTG 


CCGATCGGCT 


97140 


AGGCCGGCGG 


GCTCTCTGTG 


97200 


GGACCAGGTG 


GCCGCGGTTT 


97260 


GGGGGGGCGC 


GGCCGGCCTG 


97320 


GGCGGCGGGG 


CGCCCCGCGG 


97380 


AGCCCACGGC 


GGCTCGGAAG 


97440 


TCTGCGCTGG 


GAGCTGTGCC 


97500 


TGCCCGTGCC 


CGCCCTCAGG 


97560 


TCTTCTTGCA 


GCCCTGGTCC 


97620 


TCGGCCTTGC 


AGACCTACCG 


97680 


CTGGATTCGT 


CTGAGCAGGC 


97740 


GAGCTCACCG 


AGGCCAGCCT 


97800 


GAAGAGCGCC 


CGGGCCGCCA 


97860 


GAGGCGGTGG 


AGGCACGAGC 


97920 


GAGGCGGGGC 


CAGAATCTGC 


97980 


GAGCTGAGTG 


GGACAAAGGT 


98040 


CACAACGCCA 


TGGTGGTGGG 


98100 


CTTTACCTGT 


ACCCCACTCA 


98160 


TGCCGCTTTA 


AGGAGAACTG 


98220 


GTAAGGGGCA 


CGCACACAGG 


98280 


TCTCAGCTTC 


CTGGGGCAGG 


98340 


TTCCTTCTGG 


GGTGAATCAA 


98400 


AATCCCAGCA 


CTTTAGGAAG 


98460 


ACGCTGGTCA 


ACATGGTGAA 


98520 
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ACCCCATCTC 


CTTAAAAACA 


CAAAAATGAA 


CCGGTCGTGG 


TGGCGCGCAC 


CTGTGGTCCT 


98580 




GGCTACTCAG 


GAGGCTGAGG 


CAGGAGAATT 


GGTTGAACCC 


AGGAGGCCGA 


GTTTGCAGTG 


98640 




AGTGGAGATG 


CAGCCACTGT 


ACTGCAGCCC 


GAGCAGCAGT 


GCAAGGCTTA 


TGTGGAAGAG 


98700 




AGTAGGTCTC 


CAGCCTATCG 


TCAGTTTTTT 


TTTGGTGGTT 


GTTTTAATTT 


TTTTTGAGAC 


98760 


5 


AGGGTCTTAC 


TTTGTCAACC 


AGGCTGGAGT 


GCAGTGGCAT 


AGTCCTGGCT 


CACTGCAGCC 


98820 




TGGACCTCCT 


GGGCTCAACC 


GATCCTCCTG 


CCTCAGCCCC 


CCTAGGAGCT 


GGGCTACAGA 


98880 




CTCACGCTAC 


TACACCCAGC 


TAATTTTTAT 


ATTACTATAA 


TTTTTTATCT 


TTTTTTTGAG 


98940 




ACGGAGTCTT 


GTTCTGTTGC 


CCAGGCTGGA 


GTGCAGTGGC 


GTGATCTCGG 


CTCACTGCAA 


99000 




GCTCCGCCTC 


CCGGGTTCAC 


GCCATTCTCC 


TGCCTCAGCC 


TCCCGAGTAG 


CTGGGACTAC 


99060 


10 


AGGCGCCCGC 


CACCATGTCT 


GGCTAATTTT 


CTGTATTTTT 


AGTAGAGACG 


GGGTTTCACC 


99120 




ATGTTAGCCA 


GGATGGTCTC 


AATCTCCTGA 


CCTCGTGATC 


CGCCCACCTT 


GGCCTCCCAA 


99180 




AGTGCTGGGA 


TGACAAGCGT 


GAGCCACCGC 


GCCTGGCCTT 


TTTTTTTTGG 


AGACAGAGTT 


99240 




TCACTCTCCT 


CACCCAGGCT 


GGAGTGTAGT 


GGCGCAATCT 


CAGCTTACCG 


CAACCTCTGT 


99300 




CTCCCGGGTT 


GAAGTAATTC 


TCTACCTCAG 


CGTCCAGAGT 


AGCTGGCATT 


ACAGGCGCCC 


99360 


15 


GCCACCACAC 


TCGGCTAATT 


TTTTGTATTT 


TTAGTAGAGT 


CGGAGATTCA. 


CCATCTTGGC 


99420 




CAGGCTGGTC 


TTGAACTCCT 


GACCTCGTGA 


TCCACCCACC 


TTGGCCTCCC 


AAAGTGCTGG 


99480 




GATCACAGGC 


GTGAGCCACT 


GCGCCTGGCC 


CTGTTGTTAG 


TTTTATTCTC 


TAGAGTTCAA 


99540 




CTTTTAAATT 


TTACTTTCAT 


GGAGATTTTC 


AAACATACCC 


CAAATTAGAG 


AGTTTAGCAT 


99600 




AATCACCGCC 


CACGGTCCAT 


CATCCAATGT 


CGTCATTTAT 


TAATATTTTC 


CCAGTCTCAT 


99660 


20 


TTTGTCTGTT 


CTCCCTGCCC 


TATTTTTTTC 


TTTCCTGGGC 


CATTTTAAAG 


CAAATTCCAG 


99720 




AAGTTACTGG 


TTTTTTCCAA 


TTATGAATAC 


TTCATAGTTG 


CATCTCTAAT 


CTAACTGATT 


99780 




AGGAAATTAC 


TTAAAAAGTA 


ACTTTTTGGA 


AGTCCAAGTC 


CGATGTGAGG 


ACAAAAAAGA 


99840 




GTAACTTCTG 


TGTCATAATA 


GGTAACACAT 


TTAATGGTAA 


TACCTCTTCC 


ATATTCAAAT 


99900 




ATGAACAATT 


ATTACTGTAA 


TGTCTCTATT 


TCCCTAAGCG 


CATAGCTTTA 


TTTTTCCTCC 


99960 


25 


TTTTTACTTT 


TCTCTTAGAA 


GAAATATTTA 


CCAAGCCTTC 


TAGTAGGTAA 




100020 




TAGCCAATAG 


TTCAGGCTGA 


CCGTGTAACC 


ATCCCTAGTT 


CTAGTTCTAG 


TTCTTTGAAT 


100080 




GTCTTCCTTT 




TTGAAACAGC 


GTCTTGCTGC 


TCTGTCACCC 


AGGCTGGAGT 


100140 




GCAGTGGCAC 


AATCTCGGCT 


CACTGCAATC 


TCCGCCTCCC 


TGGCCCAAGC 


CATCCTCCCA 


100200 




CCTCAGCCTC 


CCTAATAGCT 


GATACTACAA 


GTGTGCACTG 


CCACGCCCAG 


CTAATTTTTG 


100260 


30 


TATTTTTTGT 


AGAGACGGGA 


TTTCACCATA 


TTACCCAGGT 


CTCGAATTCC 


TGATCCCTTT 


100320 




GATGAGAGAT 


CTGACACATC 


CCTGTGGTGC 


TCCCTCTGGA 


CCAGGCACTG 


CTCCAAGGGT 


100380 




TTCATATACT 


TTCATTCATC 


TGTGCAACAG 


CCCTGTAGGT 


AGGCCCTGCA 


GTCACACCAT 


100440 




CTGACAGAGG 


AGGAAACAGG 


AGTAGAAGAA 


CTGAGTGGTC 


CAGGGCTTCA 


AGGCTCAGAG 


100500 




GGCTCCAGTT 


GCCCCCAGCC 


CTCGTTCCGT 


CCCCTGCTCC 


ACCCAGTGCT 


GCTTGCCATG 


100560 
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TCGGCATCAG 


GCCTGATCTG 


AAAGCTTCCG 


GAGCATCTTA 


CAGACGTCCA 


CCTTGCCACC 


100620 




ATTCAGGACT 


GATAAGTTCT 


CTTGGATTTG 


CGTTGGACCT 


TTTTTTTTTT 


TTTAAGATGG 


100680 




AGTTTCACTG 


TTGTTGCCCA 


GGCTAGAGTA 


CAATGGCACG 


ACCTCCACCT 


CCTGGGTTCA 


100740 




AGGGATTCTC 


CTGCCTCAGC 


CTCCCAAGTA 


GCTGGGATTA 


CAGGCGCCTG 


TCACCACGTG 


100800 


5 


GTGCCCAGCT 


AATTTTTATA 


TTTTTAGTAG 


AGGCAGGGTT 


TCACCGTGTT 


GGCCAGGCTG 


100860 




GTCTCGAACC 


CTTGACCTCA 


GGTGATCCCG 


CCTTGGTTTC 


CCAAAGTGCT 


GGGATTACAG 


100920 




GCATGAGCCA 


CCACACCCGG 


CCCAGGATTT 


CTTTATATAT 


TCTGGATATC 


ATCCCTTATG 


100980 




AAGTATATAG 


TTTGCAGATA 


TTTGCTCCCA 


TTGTTTGGGT 


TGTCTTTTCA 


CTTGATATAG 


101040 




TGTCCTTTGA 


TGCACAAACA 


TTTTAAATTT 


TGATGCAGTG 


CAATTTATTG 


TTTCTTTATT 


101100 


10 


GCCTATGTTT 


TTGTCATCAG 


GTTTAAGAAA 


CCACCTCATC 


CATAGTTATG 


AGGATTTTCA 


101160 




CCTATGTTTT 


CTTCTAAGAG 


TTCTGTAGTT 


TTAGCTGTTA 


AATTTAGGTC 


TTTGATCCAT 


101220 




TTTGAGTTAA 


TrrTTGTATA 


TGTTATTAGG 


TGAGGGTCCA 


CTTTATTCTT 


TTGCATGTGG 


101280 




ATTTCCAGTT 


TTCCCAGCAC 


CATTTGTTTA 


AAAGACTGCT 


TTTTCTCCAC 


TGAATGGTCT 


101340 




TGGCACTTTT 


GTCCAAAATC 


AATTGGCAAT 


ATATGTAAGG 


GTTTATTTCT 


GAGCTCTCTC 


101400 


15 


TCCTGTTCCA 


TTGGTGTATA 


TGTGCCAGTA 


CCACACTGTT 


CTGATTATTA 


TAGCTTTGTG 


101460 




ATAAGTTTTA 


AACTCAGGAA 


GTGGTAGTTA 


TTCACCATTT 


GCTCCTCTTT 


TTCAAGTTTG 


101520 




TTTTGTTTCT 


GGATCCTTTG 


CAATTTCATA 


TGAATTTTAG 


GATCGGCTTG 


TCCAATTCTG 


101580 




CATAAAAGAC 


AGTTTGAATT 


TTGATATGGA 


TTGCATAGAA 


TGTGTAGATC 


TGTTTGGGGC 


101640 




ACATTGTCAT 


CTTTACAATA 


TTAAGCCTTC 


TGGCTGGGTG 


TGGTGGCTGA 


CGCCTGTAAT 


101700 


20 


CCCAGTACTT 


TGGGAGGCTG 


AGGCGGGCAT 


ATCACTTGAG 


GTCAGGAGTT 


CAAGACCAGC 


101760 




CTGGCCAACG 


TGGTGAAACC 


CCGTCTCTAC 


TAAAAATAAA 


AAACAAATTA 


GTCGGAGGTG 


101820 




GTGCACACCT 


GTAATCCCAG 


CTACAGGAGA 


GGGTGAGGCA 


GGAGAATCGC 


TTGAACCTGG 


101880 




GAGGAGGAGG 


TTGCAGTGAG 


CTGAGATCAT 


GCCACTGCAC 


TCCAGCCTGG 


GTAACAGAGG 


101940 




GAGACTCCAT 


CTTAAACAAC 


AACAATAACA 


GAAGAAAAAA 


ACAGTATTAA 


GTCTTCCAAT 


102000 


25 


TCATGAATGA 


AGGATCTGTC 


CATTTATTTA 


CGTCTTTAAT 


TTCTTTCAAC 


AGTATTTTGT 


102060 




ACTGTTCAAG 


TCTTGCACAT 


TCTTGGTTAA 


ATAAGTATTA 


TTTTTGATGC 


TTCTCTAAGG 


102120 




AATTGTTTTT 


CTTTTCCTTT 


TTTTTTTTGA 


GACAGAGTCT 


TGCTCTGTCA 


CCCAGGCTGG 


102180 




AGTGCAGTGG 


CACAATCTTG 


GCTCACTGCA 


ACCTCTGCCT 


CCCGGGTTCA 


AGCAATTCTT 


102240 




CTGCTCAGCC 


TCCCAAGTAG 


CTGGGATCAC 


AGGTGCCTGC 


CACCACACCC 


AGCTAATTTT 


102300 


30 


TTTTTTTGAG 


ATGGAGTCTT 


GCTCTGTTGC 


CCAGGCTGGA 


GTGAAGTGGC 


CCAATCTTGG 


102360 




CTCACTGCAA 


GCTCCACCTC 


CCGGGTTCAC 


ACCATTCTTC 


CGCCTCAGCC 


TCCTGAGTCG 


102420 




CTGGGAATAC 


AGGTGCCTGC 


CACCACGCCC 


AGCTAATTTT 


TTGTATTTTT 


AGTAGAGATG 


102480 




GGGTTTCACC 


ATGTAGCCAG 


GATGGTCTCG 


AACTCTTGAC 


CTCAGGTGAT 


CTGCCTGCCT 


102540 




CGGCCTCCCA 


AAGTGCTGGG 


ATTACAGATG 


TGAGCCACTG 


TGCCCGGCTC 


GAGTTGTTTT 


102600 
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CCTTAGTTAC ATTTTCAGGC TGTTTGTTGC 
GGCTCACGCC TGTAATCCCA GCACTTTGGG 
GGAGTTCGAG ACCAGCCTGG CCAACATGGT 
TTAGTCTGGC ATGGTGGCAG GTGCCTGTAA 
5 AATTGCTTGA ACCTGGGAGG CGGAGGTTGC 
GCTTGGGGAA CAAGAGTGAG ACTTCATCTC 
TTTTTATGTT AATCCTGTAT TGATTGCTGA 
ATGCACTATT TAGGATTTTC GATATATACA 
TATATGTGAA TAGAGATAAT TGTAGTCTTT 
10 . TTCTTGCTTA ACTGCCTTAG CTAGAACTTC 
CGGGCCGGGG GTGGTGGCTC ACACCTGTGT 
GATCACTTGA GATCAGCAGT TTGAGACCAG 
CTAAAAATAC AAAAATTAGC TGGGTGAGGT 
AGGGTGAGAC ATGAGAATTG CTTGAACCTG 
15 TGCCACTCCA CTCCAGCCTG GACGACAGAG 
AAAAGTGGCT AGAACAAACA TCTTTATCTT 
TTCCTGATAT TAGGTGGAAA ACTTTTAGTC 
GTTTTCTGTA GATTCCCTTT ATCGAGTTGA 
GTGTTTTTTA TCATGAAAGG GTGTTGATTT 
20 CCCAGGCTGG AGGGCAGTGG CATGATCATG 
GGGGATCCTC CTACTTCATC CTCCTGAGTA 
CAGCTAATTT TTTAATTTTT CTGTAGAGGT 
TAAACTCCAG GGCTCAAGCA ATCCTGCCTC 
GAGTCACTGC ACTGCACCCA GCTGTGTGGG 
25 TCATGTGTGG TTTTTTTCCT TTCATTTTGT 
ACCATCCTTG AATTCCTCAG ATAAAGCACG 
TATTTTTTTT GTAGAGATGA GATTTCACTC 
CTAAAGTGAT CCTCCTGCCT CAGCCTCCGA 
GGCCCTATCT TTTTTCTTTT TCTTTTTTTT 
30 CCGGGCTGGA GTGAGTGGCG CGATCTCGGC 
CAATTCTCCT GCCTCAGCCT CCTGAGTAGC 
GCTAATTTTT TGTGTTTTTA GTTGAGACGG 
CACTCCTGAC CTCGTGATTC ACCCACCTTG 
AGCCACCGCA GCGAGCCTTA TCTTTTTAAC 



TAGTATATAG AAATACAAGC TGGGCACCGT 102660 
AGGCCAAGGC GGGTGGATCA CCTGTGGTCA 102720 
GAAATCCAGC CTCTATTAAA AATACAAAAA 102780 
TCCCATCTAC TCAGGAGGCT GAGGCAAGAG 102840 
AGTGAGCTGA GATCGCGCCA TTGCACTCCA 102900 
AAAAAAAAAA AAAAAGAAAT ACAGTGGATT 102960 
ATTGGTTTAT TAGTGCTAAT AGGATTTTTT 103020 
ATCATATATA TTCAATATAT ACAATTAATA 103080 
GTTTCTAGTT TGCATGGCAT TTATTTCTTT 103140 
AAGTACGATG TTGAATAAAA GTGACTAGAG 103200 
TCCCAGCACT TTGGGAGGTG GAAGTGGGCA 103260 
CCTGGCCAAC ACGGCGAAAC CCCATCTCTA 103320 
GATGTGCACC TGTAGTCCCA GCTACTTGAG 1033 80 
GGGGGCGGAG GTTGCAGTGA GCCAAGATCA 103440 
CAAGAACCCT GTCTTTAAAA AAAAAAAAAA 103500 
GTTCCTGATC TTAGGTGGAA AACTTTTTTG 103560 
TTTCACTGTT GAATATGATG TTACTTGTAG 103620 
GGAAATTCTC TTATATTCAT AGTGTGTTGA 103680 
TTTTTTTAAA GATAGGGTCT TGTTCTGTCA 103740 
GCTCACTGCA ACCTCGAATT CCTGGGCTCA 103800 
GGTGAGACTA CAGGCATGAG CCACCATGCC 103860 
AGGGTCCTGC TTTGCTGCCC AGGCTGGTCT 103920 
AGCCTCCCAA AGTGCTGAGA TTACAGGGGT 103980 
ATTTTTCAAA TGCTTTTTTC CTTTAGATGA 104040 
TAATGTGGTA TATTGATTTT CGTATGTTGA 104100 
CATATTCATG GCGTATTATC TCTTTATTAT 104160 
TGTTGCCCAA GCTGGTCTCA AACTCCTGGG 104220 
AAGCGCTGGG ATTATAGGCA TGAGCCACTT 104280 
TTTTTTTTGA GACAGAGTCT CACTCTGTCG 104340 
TCACTGCAAC CTCCATCTCC CGGGTTCAAG 104400 
TGGGACTACA GGTGCCCGCC ACTATGCCCA 104460 
TGTTTTGCCA TGTTGGACAG GCTGGTCTTG 104520 
GCCTCCCGAA GTGCTGGGAT TACAGGCATG 104580 
AGTTAAAAGT TTAAGGCCTT ATCATGTAAT 104640 
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AACATTGCTG GATTTGATTT 
GGGATATTGG TCTGTAGTTT 
TAATATTGGC CTCATAGAAT 
AGGGATTGGT GTTAATTCTT 
5 ACAATCATAT ATATAGAGAG 
ATTGACTATT GATTCAATTT 
ATAACTGTAG TGAAAGTTTC 
TTTATTTGTT ATAGGTCTAT 
CTGTCGCCCA GGGTGGAGTG 

10 AGTTCAAGTG ATTCTCCCAC 
CATACCTGGC TAATTTTTGT 
GGTCTCGAAC TCCTGACCTC 
ACAGACATAA GCCACCGCGT 
TAGGAATTTG TCCATTTTGT 

15 TCCTTTTATA GCCCTAGTTA 
GGTTTTAGCA ATTTGAGTCA 
CACTGTGTCA CCCAGGCTGG 
CCCAGGTTCA AGCAATTCTG 
CCACACCCAG CTAGTTTTGT 

20 CGAGGCTGGA GTGCAGTGGT 
CAATTCTCCT GCCTCAGCCT 
GCTGATTTTT ATATTTTTAG 
ACTCCCTACC TGAGGTGATC 
GAGCCACCAT GCCCAGCCAG 

25 CCAGGCTGGT CTTGAAATCC 
TAGGATTACA GGCATGAGCC 
AAAGGTTTGT CAATTTTGTT 
TTTCATTTAT TTCCATTTTA 
CTCTTTTTTT CCCCTAATTT 

30 TTTCTTTTCT XTTTTTTTTT 
TGTCACCCAG ACTGGAGTGC 
GCGATTCTCC TGCCTCAGCC 
ATGTAAGCAT TTACAGCTAC 
ATTGTTTTTT GTTGTTTTTT 



GCTGCTGTTT TGTTGAGAAT 
TCTTTTCTTG GCATGTCTTT 
GAGTTAGGAA GTATTCTTTA 
CTTCAAATGT TTGATAGAAT 
AGAGAGAGAG AGAGATGGAC 
CCTTATTGAA ATTGACTTTT 
TGAACTTTTC TTTCATTGGA 
TCAGATTTTC TGTTTCTTCT 
CAGTGGTGCC ATCTTGGCTC 
CTCAGCCTCC CCAGTATCTC 
ATTTTTAGTA GGAACAGCAT 
AGGTGATCCA CCCGCCTCGG 
CCAGCCTTGA GTCAGTTTAG 
TTATGTTATC TAATCTGTTA 
TTTCTGTAAG ATCAGTAGTA 
TCTCTTTTCT TCTTCTTTTT 
AGTGCAGTGG CATGATCTTG 
CCTTAGCCTC CTGAGTAGCT 
TTTGTTTTTT TGTTTTTGAG 
GCAATCTCAC TCATTGCAAC 
CCCGAGTAGC TGGAACTATA 
TAGAGATGGG ATTTCACCAT 
CGCCCACCTT GGCCTCCCAA 
TTTTTGTATT TTTAGTAGAG 
TGACCTCAGG TTATCCACCA 
ACCACGCATG GCCTGTCTTT 
GATCTTTTTT GTTGCTGATC 
ACCTTTGTTT CCTTTTTTCT 
TTCAAGGTAT ACAGTTAAGT 
TTTTTTTTTT TTTGGTTGCT 
AGTGGCATGA TCTCAGCTCA 
TCCTGAGTAG ACGTTTCCCG 
AGATTT'CCCT CTAAACACTG 
GTTGTTGTTT TGTTGTTTGA 



ATTTGCATCT 


GTATTGATAA 


104700 


GTATAGCTTT 


GATGCCAGCA 


104760 


TATTATGGGA 


AGAGGTAAAA 


104820 


TCAACAGTGA 


AGTGATATAT 


.104880 


TTTTCTTTTG 


TTGGAAGTTT 


104940 


CTTTTTGGAA 


GCTAAAATGT 


105000 


AGTTTTTTGA 


CTACTGATTC 


105060 


TGAGTCAGTT 


TGGTCTCGCT 


105120 


ACTGCAACTT 


CTACCTCCCG 


105180 


GGACTACAGG 


CGCACGCCAG 


105240 


TTCACCATGT 


TGGCCAGGCT 


105300 


CCTCACAAAG 


TGCTGGGACT 


105360 


ATAGTTTGCA 


TGCATGTTTC 


105420 


CCATACAATT 


GTTCATAGTA 


105480 


ATAGCTCCAC 


TTTCTCTCTT 


105540 


TTTTTTTTGA 


GATGGAGTCT 


105600 


GCTCACTGCA 


ACCCCTGCCT 


105660 


GGGATTACAG 


GTGTGAGCCA 


105720 


ACGGAGTCTG 


TTTCTGTCTC 


105780 


CTCCGACTCC 


CAGATTCCAG 


105840 


GGCGTGCACC 


ACCACGCCTG 


105900 


GTTGGCCAGG 


CTGGTCTTGG 


105960 


AGTGCTGGGA 


TTATAGGCAT 


106020 


ATGGGGTTTC 


TCCCTGTCGG 


106080 


GCCTTGGCCT 


CCCAAAGTGC 


106140 


TCTTCTTGGT 


CATTTTCGCT 


106200 


TCTATTGTTT 


TCCCATTCTG 


.106260 


GCTGGTTTGG 


GTTTAATTTG 


106320 


TATTGATTTG 


AGATCTCTTT 


106380 


GTTGAGATGG 


AGTCTCCCTC 


106440 


CTGCAGCCTC 


CGCCGCCCAG 


106500 


GCCAAGGTGT 


TTCTTTTTGA 


106560 


CTTTCACTGC 


ATTCCATAAG 


106620 


GACACAGTCT 


CACTCTGTTG 


106680 
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CCGTTTGGAG AGCAGCGATG CGATCATAGC 
GTCCTCCTGC CTCAGCCTCC CAAGTAGCTG 
TAATTTCTTT TATAAGTTTT TGCAGAGGCC 
CACTTTGGGA GGCCAAGGTG GGTGGATCAC 
5 CGACAGGGAG AAACCCCATC TCTACTAAAA 
TGCCTGTAAT CCCAGCTACT CAGGAGGCTG 
AGAGGTTGCA GTGAGCCAGG ATCACACCAT 
CTCCATCTCA AGAAAAGAAA AAAAAAAGTT 
: CAGGCTGGTC TCAAACTCCT GACTTGAAGG 
10 , GAGATTATGG GCAAGAGCCA CCGCACCCTG 
CATTTTCATT GATCTCAAGA CATCCTAATC 
TTATTCAAGA GTGTCTTTAT TTCTGCATAT 
TTTTTTTTTT GAGAAAGGGT CTTGCTCTGT 
CTTGCCTCAC TGCAACCTCT GCCTCCCGGG 
15 AGTAGCTGGG ATTACAGGCA CACACCACCA 
ACGTGCTGGT. CAGACTGGTC TCGAATTCCT 
CCAAAGCACT GGGATTACAG GCGTGAAACA 
ATAGAGAGAA GGTCTCACTC AAGCCCAGGC 
TCCCTCCTCG GCCTCCCAAG GTGCTGAGAT 
20 TTTCAAGACT TTAAAAATGC CATCTTGGCT 
CACTTTGGGA GGCCGAGGTG GGCAGATCAC 
ACATGGTGAA ACCCTGTCTC TACTAAAAAT 
GCCTGTAGTC CCAGCTACTC GGGAAGCTGA 
GAGCTTGCAG TGAGCTGAGA TCACACCACT 
25 CCGTCTCAAA AAAAAAAAAA AAAATGTCAT 
TGAGAAATTG GCTGTTAATC TTATTGAGGA 
TTGCTGTTTT AGGAGATTCT CTATCTTTGG 
GTGGATCCCT CAATTTATAA GCTACTTGGA 
ATGTCTTTCA TTAAATTTGC AAAGTTTCAG 
30 TTTGTTTCTT TCTGTCTGTT TGCCGCTTAT 
TGGTGTAGCA CAGGTCCCTT GGGCTCTAGG 
TCCTCATTTT GGATAAATTC AGCTGACCTG 
TTCTCAAATC TGCTGTTGAA ACTTCTGGTG 
GCTCCAAAGT TTCTATTTGG TTTCTTTCTG 



TCTGTAGCCT TGAGCTCCTG GACTCAATCA 106740- 
GGACTACAGG TGTACACCAC TGCACCTAAC 106800 
AGGCACAGTG GCTCACACCT GTAATCCCAG 106860 
CTAAGGTCAG GAGTTCGAGA CCAGCCTGGC 10692 0 
ATACAAAAAT TAGCTGGGCG TGGTGGCAGG 105980 
AGGCAGGAGA ATCGCTTGAA CCTGGGAGGC 107040 
TGCACTCCAG CCTGGGTAAC AAAAGCAAAA 107100 
TTTGCAGAGA CAGGGTATCA CTTTGTTGCC 107160 
AGTCCTACTG CCTCAGCCTC CCAAAGTGCT 107220 
CCACTTGGCT GTTTTGTTCT GTTGTATTTC 107280 
TCCCTTTTGT TTTTTTGTTC GACTTACTGG 107340 
TTGTAAATTT TCCAAAAAAG TTTTTCTTTC 107400 
CGCCCAGGCT GGAGAATGGT GGTGCACAAT 107460 
TTCAAGTGAT CCTCCCACCT CAGCCTTCCC 107520 
CACCTGGCTA ATTTTTGTAT TTTAGTCTTA 107580 
GACCTCAGGT GATCTGCCCG CCTTGGCCTC, 107640 
CCATGCCCAG CCCCCAATTT TTTTTTTTTA 107700 
TGGTCTTGAA CTCCTGAGCT CAAGCTGTCA 107760 
TACAGGTGTG AGTCACAGTA CCTGGCCTTC 107820 
GGGCACGGTG GCTCACGCCT GTAATCCCAG 107880 
GAGGTCAGGA GATCAAGACC ACCCTGGCTA 107940 
ACAAAAAATT AACCAGGTGT GGTGGCAGGT 108000 
AGCAGGAGAA TGGCGTGAAC CCGGGAGGTG 108060 
GTACTCCAGC CTGGGCAACA GTGCGAGACT 108120 
CTCACTGCCT TCTGGTCCAA TAGTTTCTGA 108180 
ACATTTATAT ATTGACTAGT CACTTGTCTC 108240 
GTTTCAGCAG TTTGATTATA ATGTATCAGT 108300 
GTTCATTGGA CTTCTTGGAT GTGTAAATTC 1083 60 
CTACTATTCT TTGCATCTTG AAATACTAGT 108420 
GGAACTTTAT GCATACATTG ATGTGCTTCA 108480 
CATTTTTCTT TGTTCTTTTT TTCTTTCTGC 108540 
TCCTCAAGTT CACTGTTTCT TTCTTCTTCC 108600 
AAATTTTCAC TACAGTTACT GTACTTTTTA 108660 
TAGTAATTAT CACTTTACTA GTATTCTCTA 108720 
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TTTGGTTAGA CATGGTTCTT 
TTAAATTTCT TTTTATTTAG 
GTCACCCAGG CTGGCAGGCA 
TTCAAGTGAT TCTCCTGCCT 
5 CACCCACCTA ATTTTTGGTA 
GTCTTAAACT ACTAACCTCA 
CAGATGTGAG CCACTGTGCC 
GTTTTTGTCT AGTCATTCAA 
TATTTTTAAA AAATTTTTTT 

10 CTAGGCTGGA GTGCAATGGC 
GCGATTCTCC TGCTTCAGCC 
GGCTAATTTT TTGTATTTTC 
GATCTCGTGA CCTCATGATC 
GAGCCACCGC GCCCAGCCTA 

15 GCTGTAGTGC AGTGGCACAA 
GTCTCCCACC TTAGTCTCCT 
GGTTGTGTTG TTACTGTTTT 
CTCCTGGGCT CAAGCAGTCA 
AGCCACCGCA CCCGGCCTGT 

20 TTCTTTGGAT GCCTTGTAAT 
TTACTTTGAA AATCAGATCC 
TTGTCGTTTC TCGTTTGTTT 
TCAGGGGTGC TTGTTTCTGT 
TCTTTAAATA TCTAGTGGCA 

25 ATGCTAGGAC TTTGGGAAGT 
GCCTGGCCAG TATGGTGAAA 
TGGCACCTGC CTGTAGTCCC 
AGGGGGCGGA GGTTGCAGTG 
GCGAGACTCT GTCTCAAATA 

30 CCAGGTTGCC TCTGTACTGG 
CCTCAGCCTT TACTTCTCGC 
GGCCTCCTCA GGTGTTTCTG 
TTTCCAAGCA TGAACAGGAG 
GCCTCTTCCT TCCTAGGCTT 



TTGTTTTCCT TTAGTTCATT 
TTATTAATTT tTTTTTTTTT 
ACGTCACAAT CTTGGCTCAC 
CAGCCTCCCA AGTAGCTGGG 
TTTTTAGTAG AAACTGGGTT 
GGTGATCTGT CCGCCTCAGC 
CAGCCTCTTT TTTTAGTGTA 
ATGTCTAGGC TTCCTCAGGA 
TAATTTTCTT TTTTTTTTAG 
TTGATCTTGG CTCACTGCAA 
TCCTGAGTAG CTGGGACTAT 
AGTAGAGACA TGGTTTTGCC 
CTCCTGCCTC GGCCTCCCAA 

.prjvprp/jtrpp^-prp^ TTTGAGACAA 

CCCTGGCACA CTGCAGCCTT 
GAGTAGCTAG AACTACAAGC 
AGACACAGGG TCTTGCTACA 
TCCCACCTTG GCCTCCCAAG 
TAATTTCTTT ATTTCCGGTG 
TTTTTGTTGA AACCTGCACA 
TCCGCCCTCT GCAGGGTTCA 
AGTTACTTTC CTGACCTTTT 
TCTTTTAGGT TAGTGGTTAG 
AAAAGGATAA AGAGGCCGGG 
GGAGGCGGGT GGATCACTTG 
CCCTGTCTCT ACTAAAAATA 
AGCTACTGGG AAGACTGAGG 
AGCTGAGATT GCGCCATTGC 
AAAAAAAAAA AAAAAGGATA 
GGCAAGTCCT TCAGTGTCCG 
TCCCATGGAG CCTAAGGATG 
AGCCCCTGTC TAGCCCCAGC 
CTTTCCAAAG CCCTTAGACC 
TTCTGTCAGC TCTTTGCCCA 



ATCCATGGTT 


TCCTTTATTT 


108780 


TGAAGCGGGG 


TTTCACTCTT 


108840 


TACAACCTCC 


GCCTCCTGGG 


108900 


ATTATAGGCA 


TGTGCCACCA 


108960 


TCACCACATT 


GGCCAGACTG 


109020 


CTCCCAAAAT 


GCTGGGATTA 


109080 


TTTAAGGTAA 


TTGATTGAAA 


109140 


ACAGTTTCTA 


TTAATTTCTT 


109200 


ATGGAGTCTC 


ACTCTATAGC 


109260 


CCTCTGCCTC 


CTGGGTTCAA 


109320 


AGGTGCGTGC 


CACCACTCCT 


109380 


GTGTTAGCCA 


GGATGGTCTC 


109440 


AGTGCTGGAA 


TTACAGGTGT 


109500 


AGTCTCCCTC 


TCTCACCCAG 


109560 


AACCGTCCAG 


GCTTAAGTGA 


109620 


ATGTGCCACC 


ATGCCTGGCT 


109680 


TTTCTCTGAC 


TGGTCTTGAA 


109740 


GTGTTGAGAT 


TACAGGTGTG 


109800 


AATGGGCCAC 


ACTTTCTTGT 


109860 


ATTTGAAGAT 


GATAATGTGG 


109920 


TTGTTGCTGT 


TTGTTGTGGA 


109980 


TAAATAAAGA 


CTATATTCTG 


110040 


CTTGTGCTTT 


GAAAGAGATT 


110100 


CGCAGTGGCT 


CACGCCTGTA 


110160 


AGGTCAGGAG 


TTTAAGATCA 


110220 


CAAAAATTAA 


CCGGGCATGG 


110280 


CAGGAGAATC 


GCTTCAATCC 


110340 


ACTCCAGCCT 


GGGCAACAGA 


110400 


AAGAGTGTCT 


TCCATCCTTT 


110460 


CCAGGCTGTT 


CACGGCTTTT 


110520 


AACCAGAGGT 


GAAAGTTGAG 


110580 


TGTGTGCATG 


GCCTTCTGGA 


110640 


TTCATGTAGC 


TCTTTTCCCA 


110700 


TCTGTTGTTG 


TCCCTCCCCC 


110760 
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ACAACTTCAG GTAGTATCTA CCTGTAAATG 
GTTATCGCAG CACTTTGGGA GGCCGAGGCG 
CCAGCCTGGC CAACATGGTG AAGCCCCGTC 
TGGTGGGTGC" CTGTAATCTC AGCTACTCGG 
5 GGGAGGCGGA GGTTGCAGTG AGCTGAGATC 
GTGAGACTCC ATCTCGGGGA AAAAAAAAAA 
AGGCTGCCCC AGCCCTGAGA GAGTTCGAGG 
ACACTAGAAA GATCCAAATG CATAAGCAGG 
TCTGACACCA GCAAGCCACA TCAGAAATAC 

10 TGGTAGTAGT GGCTGAGCAG AAATAGCCCA 
CTTCATTGAG CAGTCATCTG GTTCGTAGAC 
TTTTGACAGT TTAATAGAAA AAAGTTTATT 
GTTTACTGTG ACAGTTTTGA CAGCAGAATA 
GCTGCCAACT CCATCATTTT GGTGATATCC 

15 TTTTAAGTTA TTTTCTTAGT GGTTGCTCTA 
CAATGTACTT CAGATTATTA CTAACTTAAC 
AGTTTCACTC TGTCACCCAG GCTGGAGTGC 
CGCCTCCCAG GTTCACGCCA TTCTCCTGCC 
ACCCCCACCA CACCCGGCTA ATTTTGTATT 

2G TCAGGCTGGT CTCGAACTGC TGACCTCAGG 
TGGGATTACA GGTGTGAGCG ACTGCACTGA 
GATCTACTTC CGTTGTACTA TGACATAGTT 
ACCTCCCTTT TCCTCTGACA TAGTTCCATC 
CTCTGGCATA GCTCCATCCT CCCTTCTCCT 

25 GACACAGCTC CATCCTCCCT TCTCCTATGA 
TAGCTCCATC CTCCCTTCTC CTATGTCATA 
TCCATCCTCC CTTCTCCTCT GGCATAGCTC 
TCCTCCCTTC TCCTATGACA CAGCTCCATC 
CCCTTCTCCT ATGACACAGC TCCATCCTCC 

30 TCTCCTCTGA CATAGCTCCA TCCTCCCTTC 
CTCTGACATA GCTCCATCCT CCCTTCTCCT 
GACATAGCTC CATCCTCCCT TCTCCTCTGA 
TAGCTCCATC CTCCCTTCTC CTCTGACATA 
TTGCCATATA TACATTTATG TATGTTATAA 
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CCTTCAGGCC 


AGGCGCGGTG 


GCTCATACCT 


110820 


GGTGAATTGC 


TTGAGGTCAG 


GAGTTCGAGA 


110880 


TCTAGTAAAA 


ATACAAAAAT 


TAGCTGGGCG 


110940 


GAGGCTGAAG 


CAGGAGAATT 


GCTTGAGCCT 


111000 


GTGCCATTGC 


ACTCCAGCCT 


GGGCGACAGA 


111060 


AAAAAAATGC 


CATCAACAGC 


ACGACCCTGG 


111120 


GGGTGAAACA 


AAGGCAAGCC 


CTTCAGGGAG 


111180 


ATTCCTTGAG 


AAAAGGTCTG 


TATCATGCCT 


111240 


AGGTTGCCTT 


CCCCATGGCT 


ACATGTGAGC 


111300 


GCTGTCCTCC 


TGAAATTTAG 


CAGGGTCTTA 


111360 


ACCAGAGTTA 


CAGAAAAGTT 


TATTGGGAGG 


111420 


GTGACAGTTT 


TGACAGCTGA 


ATAGAAAAAA 


111480 


GTTGCTTTGC 


TGGAGAGACG 


GATCTTTGGA 


111540 


AGCTCTGTTG 


CTGAATTTTT 


AGCTATGCTG 


111600 


GAGATGACAA 


TGTGCATCTT 


TAACTTACCA 


111660 


ACTTAAAGTA 


CAGCATTTTT 


TTTTTTATGG 


111720 


AATGGTGTGA 


TCTCGGCTCA 


CTGCAACCTC 


111780 


TCAGCCTCCT 


GAGTAGCTGG 


GACTACAGGC 


111840 


TTTAGTAGAG 


ATGAGGTTTC 


ACCATGTTGG 


111900 


TGATCCGCCC 


ATCTTGGCCT 


CCCAAAGTGC 


111960 


GCCTAAGTAT 


GGCAACGTGT 


CTATAACATA 


112020 


CCCCCTCCAT 


TTTCCTATAG 


CACAGTCCCA 


112080 


CTCCCTCCTC 


CTATGACGTC 


CTCCCTTCTC 


112140 


ATGACACAGC 


TCCATCCTCC 


CTTCTCCTCT 


112200 


CACAGCTCCA 


TCCTCCCTTC 


TCCTCTGACA 


112260 


GCTCCATCCT 


CCCTTCTCCT 


CTGACACAGC 


112320 


CATCCTCCCT 


TCTCCTATGA 


CACAGCTCCA 


112380 


CTCCCTTCTC 


CTATGACACA 


GCTCCATCCT 


112440 


CTTCTCCTCT 


GGCATAGCTC 


CATCCTCCCT 


112500 


TCCTCTGACA 


TAGCTCCATC 


CTCCCTTCTC 


112560 


CTGACATAGC 


TCCATCCTCC 


CTTCTCCTCT 


112620 


CATAGTTCCA 


TCCTCCCTTG 


TCCTCTGACA 


112680 


GCTCCATCCC 


CTCTTCTCCT 


TCATGTATTA 


112740 


CTTCAGCTCT 


TCAGCGTTAT 


AATTATTGCT 


112800 
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TCAAAAGTAT 


TTTGAAAGAA 


GTTGCCTGGA 


GGCAGTGGCT 


TATGCCTTTA 


ACTCCAGCAC 


112860 




TTTTGGGGGC 


TGAGGTGGGC. 


AGATCGCCTG 


AGCCAGGGAG 


TTGGAGACCA 


GCCTGGGCAA 


112920 




CATGACGAAA 


CCCATCTCCA 


CCAAAATTAC 


AAAAAATTAG 


TCTGGCATGG 


TGGCACGCGC 


112980 




CTGTAGTCCC 


AGCTATTTGG 


GGGAGGATCC 


CAGCTAAGGT 


GGGAGGATCA 


CTTGAGCCTG 


113040 


5 


GGAAGTCAAG 


GCTGCAGTGA 


GCTGAGATTG 


TGCCACTGCA 


CTCCAGCCTG 


GGTGCAGATC 


113100 




TTATCTCAGA 


AGTAAAGGGA 


CTAGGAATGG 


TGGCTTTTAT 


CTCTAATCCC 


AGCACTTTGG 


113160 




GAGGCTGAGG 


TGAGTGGATC. 


ACCGGAGGTC 


AGGAGTTTAA 


GACCAGCCTG 


GGCAACATGG 


113220 




TGAAACCCCG 


TCTCTACTAA 


AAATACAAAA 


AGTAGCCGGG 


TGTGGTGGTG 


GGTGTCTGTA 


113280 




ATCCCAGCTA 


CTCGGGAGGC 


TGAGGCAAGA 


GAATCGCTTG- 


AACCTGGGAA 


GCGGAGGTTG 


113340 


10 


CAGTGAGCAA 


GATCGCACCA 


CTGCATTACA 


GCCTAGATGA 


CAGAGCGAGA 


CTGTGCCTAA 


113400 




AAAAAAAAAA 


AAAAAGAAAA 


GAAAAGAAAT 


TAAGATCTAG 


ACACTGTGGT 


TCATGCCTGT 


113460 




AATCCCAAAG 


CCTTGGGAGG 


CCAAGGCAGG 


AGGATCACTT 


GAGGCCAGGA 


GTTGAACACC 


113520 




AGCCTGGGCA 


ACATAGCGAG 


ACTCCATCTC 


TATTTAAAAA 


AGAAAGAAAT 


TCAAAGAGAA 


113580 




AAAAAGTATA 


CTTGTTTTTT 


TGTATCATCC 


ATATTTTACC 


TTTCTTTTTT 


TTGCCCCTTT 


113640 


15 


TTCTTTCCTG 


TGAATTTGAG 


TTACTGTCTA 


GTGTCATTTC 


CTTTTAGTCT 


GAAGAACTTC 


113700 




ATTTAGAATT 




TTTGAGACAA 


AGTCTCACTG 


TGTTGCCCAG 


GCTGGAGTGC 


113760 




AATGGTGCAG 


TCTCAGATCA. 


CTGCAACCTC 


TGCCTCCCTG 


GTTAGAGTGA 


TTTTCCTGCC 


113820 




TCAGCCTCCC 


AAGTAGCTGA 


GACTGCAGGC 


ACCTGCCACC 


ACCCCCAGCC 


AATTTTTTTG 


113880 




GTATTTTTAG 


TAGAGACAGG 


GTTTCACTAT 


GTTGGCCAGG 


CTGGTCTCGA 


ATTCATGACC 


113940 


20 


TCATGATCTG 


CCTGTCCTGG 


CCTCCCAAAA 


TGCTGGGATT 


ACCATGAGCC 


ACCACGCCCA 


114000 




GCCCATTTAG 


AATTTCTTTT 


TTTTTTTTTT 


TTTTGAGATG 


GGGTCTCGCT 


CTTGTTTCCC 


114060 




AGGCTGGAGT 


GCAGTGGCAC 


GATCTCGGCT 


CACTGCGAGC 


TCCGCCTCCC 


GGGTTCACGC 


114120 




CATTCTCCTG 


CCTCAGCCTC 


CCGAGTAGCT 


GGGATTACAG 


GCGCCTGCCA 


CCACGCCCAC 


114180 




CTAATTTTTT 


GTATTTTTAG 


GAGAGATGGG 


GTTTCACCAT 


GTTAGCCAGG 


ATGGTCTTGA 


114240 


25 


TCTCCTGACC 


TCGTGATCCG 


CCCGCCTTGG 


CCTCCCAAAG 


TGCTGGGATT 


ACAGGCGTGA 


114300 




GCCACCGCGC 


CCGGCTAGAA 


TTTCTTGTAG 


GACAGGCTTG 


CTAGCAACCA 


ATTCAGTGTT 


114360 




TATTTGGGAA 


TGTCTTTATT 


TCAGCTTCAT 


TTTTTGAAGG 


ATAGTTTAGC 


TGGCTATAGA 


114420 




ATTATTAATT 


GATCATTCrr 


TTCAGTGTTT 


AAAAGTGTCA 


TCATGCTACC 


TTCTGGGTTC 


114480 




CATTGTTTCT 


GATGAGAAGT 


CATCTGTCAA 


ATTGTCCCTT 


TGTACTTGAA 


GAATTATCTT 


114540 


30 


TTTTTCTCTT 


GATGTTTTCA 


AGATTTTCTC 


TTTGTCTTTG 


GCCTTTAGTA 


GTTTGTGATG 


114500 
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TATCTAGGTG TGGATCTCTT GGTGTGCATC GTATTTGGGC TTCAGTAAGC CTCTTAGATT 114660 
CATAGATTAA TGTTTTGTTT TGTTTTACCA AATTTGGAGA GTTTTTACTC ATCATTTCAA 114720 
CAAATTTTTT TCCTGCCCCT CTCTCATCTC CTTTTGGGAG TACCACTGCA TGTATGTTGG 114780 
TGTGCGTTCT CTA (SEQ ID N0:3) 114793. 

5 The present invention also relates to a portion of SEQ ID N0:3 which comprises 5' 

regulatory regions, exons, introns and 3' non-translated regions which comprise the human 
NHL gene of the present invention. Such regulatory sequence may be found within the 
various regions of this 1 15 kb fragment. The 5' portion of SEQ ED NO: 1 begins at nucleotide 
47095 of SEQ TD N0:3, the initiating ATG of human NHL is from nucleotide 48687-48689 
iO of SEQ E) N0:3, the termination TAG' codon is from nucleotide 84855-84857, while the 3' 
terminus of SEQ ID NO: 1 as disclosed herein (GC AGTGCCC) corresponds to nucleotides 
85308-85316. To this end, one preferred aspect of the invention is an isolated genomic 
fragment or fragments which comprise from about nucleotide 470000 to about nucleotide 
85500 of SEQ ID N0:3), which comprises the portion of the genomic clone encoding the 
15 mRNA transcript responsible for human NHL (see Figure 5 A-B). The genomic sequence 
encoding NHL contains 35 exons (Figure 5 A). An especially preferred aspect of the 
invention is a human genomic fragment or fragments which comprise from about nucleotide 
47095 to about nucleotide 85316 of SEQ ID N0:3. As noted in regard to SEQ ID N0:1, the 
present invention also relates to DN A vectors and recombinant hosts which comprise at least 
20 a portion of SEQ ID N0:3. Portions of the 1 15 kb genomic fragment may be housed in 

multiple vector/hosts so as to optimize handling of the DNA sequences within SEQ ID NO:3. 
Therefore, the present invention relates to the isolated genomic sequence which set forth as 
SEQ ID NO:3, a region of SEQ ID N0:3 which contains the coding and non-coding region ol 
human NHL, as well as cf^-acting sequences within SEQ ID NO :3 which effect regulation of 
25 transcription of one or more of the genes localized within this 1 15 kb human genomic 

fragment, including regulatory regions effecting levels of NHL, M68/DcR3, SGUP and ARP. 
As noted above, this region of chromosome 20 (20ql3.3) is associated with tumor growth. 
Therefore, an aspect of this invention also comprises, as one example, the use of one or more 
regulatory regions of this 1 15 kb genomic sequence as a target to antagonize the effect of a 
30 transcriptional factor(s) which normally upregulate expression of a gene which has a caustic 
role in tumor growth. Alternatively, compounds may be selected which interacts with a 
specific cz5-acting sequence to upregulate a gene within this region, where upregulation 
results in a decrease in tumor growth. 

The present invention is also directed to methods of screening for compounds 
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which modulate the expression of DNA or RNA encoding a NHL protein. 
Compounds which modulate these activities may be DNA, RNA, peptides, proteins, 
or non-proteinaceous organic molecules. Compounds may modulate by increasing or 
attenuating the expression of DNA or RNA encoding NHL, or the function of the 

5 NHL-based protein. Compounds that modulate the expression of DNA or RNA 
encoding NHL or the biological function thereof may be detected by a variety of 
assays. The assay may be a simple "yes/no" assay to determine whether there is a 
change in expression or function. The assay may be made quantitative by comparing 
the expression or function of a test sample with the levels of expression or function in 

10 a standard sample. Kits containing NHL, antibodies to NHL, or modified NHL may 
be prepared by known methods for such uses. 

The DNA molecules, RNA molecules, recombinant protein and antibodies of 
the present invention may be used to screen and measure levels of NHL. The 
recombinant proteins, DNA molecules, RNA molecules and antibodies lend 

15 themselves to the formulation of kits suitable for the detection and typing of NHL. 
Such a kit would comprise a compartmentalized carrier suitable to hold in close 
confinement at least one container. The carrier would further comprise reagents such 
as recombinant NHL or anti-NHL antibodies suitable for detecting NHL. The carrier 
may also contain a means for detection such as labeled antigen or enzyme substrates 

20 or the like. 

The assays described above can be carried out with cells that have been 
transiently or stably transfected with NHL. The expression vector may be introduced 
into host cells via any one of a number of techniques including but not limited to 
transformation, transfection, protoplast fusion, and electroporation. Transfection is 

25 meant to include any method known in the art for introducing NHL into the test cells. 
For example, transfection includes calcium phosphate or calcium chloride mediated 
transfection, lipofection, infection with a retroviral construct containing NHL, and 
electroporation. The expression vector-containing cells are individually analyzed to 
determine whether they produce NHL protein. Identification of NHL expressing cells 

30 may be done by several means, including but not limited to immunological reactivity 
with anti-NHL antibodies, labeled ligand binding, the presence of host cell-associated 
NHL activity. 

The specificity of binding of compounds showing affinity for NHL is shown 
by measuring the affinity of the compounds for recombinant cells expressing NHL. 
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Expression of human NHL and screening for compounds that bind to NHL or that 
inhibit the binding of a known, radiolabeled Hgand of NHL provides an effective 
method for the rapid selection of compounds with high affmity for NHL. Such 
ligands need not necessarily be radiolabeled but can also be nonisotopic compounds 

5 that can be used to displace bound radiolabeled compounds or that can be used as 
activators in functional assays. Compounds identified by the above method are likely 
to be agonists or antagonists of NHL and may be peptides, proteins, or non- 
proteinaceous organic molecules. 

Accordingly, the present invention is directed to methods for screening for 

10 compounds which modulate the expression of DNA or RNA encoding a NHL protein 
as well as compounds which effect the function of the NHL protein. Methods for 
identifying agonists and antagonists of other receptors are well known in the art and 
can be adapted to identify agonists and antagonists of NHL. For example, 
Cascieri et al (1992, Molec. Pharmacol, 41:1096-1099) describe a method for 

15 identifying substances that inhibit agonist binding to rat neurokinin receptors and thus 
are potential agonists or antagonists of neurokinin receptors. The method involves 
transfecting COS cells with expression vectors containing rat neurokinin receptors, 
allowing the transfected cells to grow for a time sufficient to allow the neurokinin 
receptors to be expressed, harvesting the transfected cells and resuspending the cells 

20 in assay buffer containing a known radioactively labeled agonist of the neurokinin 
receptors either in the presence or the absence of the substance, and then measuring 
the binding of the radioactively labeled known agonist of the neurokinin receptor to 
the neurokinin receptor. If the amount of binding of the known agonist is less in the 
presence of the substance than in the absence of the substance, then the substance is a 

25 potential agonist or antagonist of the neurokinin receptor. Where binding of the 
substance such as an agonist or antagonist to is measured, such binding can be 
measured by employing a labeled substance or agonist. The substance or agonist can 
be labeled in any convenient manner known to the art, e.g., radioactively. 
fluorescendy, enzymatically. 

30 Therefore, the present invention includes assays by which modulators of NHL 

are identified. As noted above, methods for identifying agonists and antagonists are 
known in the art and can be adapted to identify compounds which effect in vivo levels 
of NHL. Accordingly, the present invention includes a method for determining 
whether a substance is a potendal modulator pf mammalian NHL levels that 
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comprises: 

(a) providing test cells by transfecting cells with an expression 
vector that directs the expression of NHL in the cells; 

(b) exposing the test cells to the substance; 

5 (c) measuring the amount of binding of the substance to NHL; 

(d) comparing the amount of binding of the substance to NHL in 
the test cells with the amount of binding of the substance to control cells that have not 
been transfected with NHL or a portion thereof; wherein if the amount of binding of 
the substance is greater in the test cells as compared to the control cells, the substance 

10 is capable of binding to NHL. 

The conditions under which step (b) of the method is practiced are conditions 
that are typically used in the art for the study of protein-ligand interactions: e.g., 
physiological pH; salt conditions such as those represented by such commonly used 
buffers as PBS or in tissue culture media; a temperature of about 4°C to about 

15 The assays described above can be carried out with cells that have been 

transiently or stably transfected with NHL. Transfection is meant to include any 
method known in the art for introducing NHL into the test cells. For example, 
transfection includes calcium phosphate or calcium chloride mediated transfection, 
lipofection, infection with a retroviral construct containing NHL, and electroporation. 

20 Where binding of the substance or agonist to NHL is measured, such binding 

can be measured by employing a labeled substance or agonist. The substance or 
agonist can be labeled in any convenient manner known to the art, e.g., radioactive! y, 
fluorescently, enzymatically. 

Therefore, the specificity of binding of compounds having affinity for NHL 

25 shown by measuring the affinity of the compounds for recombinant cells expressing 
the cloned receptor or for membranes from these cells. Expression of the cloned 
receptor and screening for compounds that bind to NHL or that inhibit the binding of a 
known, radiolabeled ligand of NHL to these cells provides an effective method for the 
rapid selection of compounds with high affinity for NHL. Such ligands need not 

30 necessarily be radiolabeled but can also be nonisotopic compounds that can be used to 
displace bound radiolabeled compounds or that can be used as activators in functional 
assays. It is also possible to construct assays wherein compounds are tested for an 
ability to modulate helicase activity in an in vitro- or in vivo- based assay. 
Compounds identified by the above method again are likely to be agonists or 
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antagonists of NHL and may be peptides, proteins, or non-proteinaceous organic 
molecules. As noted elsewhere in this specification, compounds may modulate by 
increasing or attenuating the expression of DNA or RNA encoding NHL, or by acting 
as an agonist or antagonist of the NHL receptor protein. Again, these compounds that 

5 modulate the expression of DNA or RNA encoding NHL or the biological function 
thereof may be detected by a variety of assays. The assay may be a simple "yes/no" 
assay to determine whether there is a change in expression or function. The assay may 
be made quantitative by comparing the expression or function of a test sample with 
the levels of expression or function in a standard sample. 

10 Expression of NHL DNA may also be performed using in vitro produced 

synthetic mRNA. Synthetic mRNA can be efficiently translated in various cell-free 
systems, including but not limited to wheat germ extracts and reticulocyte extracts, as 
well as efficiently translated in cell based systems, including but not limited to 
microinjection into frog oocytes, with microinjection into frog oocytes being 

15 preferred. 

Following expression of NHL in a host cell, NHL protein may be recovered to 
provide NHL protein in active fonm. Several NHL protein purification procedures are 
available and suitable for use. Recombinant NHL protein may be purified from cell 
lysates and extracts by various combinations of, or individual application of salt 

20 fractionation, ion exchange chromatography, size exclusion chromatography, 
hydroxylapatite adsorption chromatography and hydrophobic interaction 
chromatography. In addition, recombinant NHL protein can be separated from other 
cellular proteins by use of an immunoaffinity column made with monoclonal or 
polyclonal antibodies specific for full-length NHL protein, or polypeptide fragments 

25 of NHL protein. 

Polyclonal or monoclonal antibodies may be raised against NHL or a synthetic 
peptide (usually from about 9 to about 25 amino acids in length) from a portion of 
NHL disclosed in SEQ ID N0:2. Monospecific antibodies to NHL. are purified from 
mammalian antisera containing antibodies reactive against NHL or are prepared as 

30 monoclonal antibodies reactive with NHL using the technique of Kohler and Milstein 
(1975, Nature 256: 495^97). Monospecific antibody as used herein is defined as a 
single antibody species or multiple antibody species with homogenous binding 
characteristics for NHL. Homogenous binding as used herein refers to the ability of 
the antibody species to bind to a specific antigen or epitope, such as those associated 
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with NHL, as described above. Human NHL-specific antibodies are raised by 
immunizing animals such as mice, rats, guinea pigs, rabbits, goats, horses and the 
like, with an appropriate concentration of NHL protein or a synthetic peptide 
generated from a portion of NHL with or without an immune adjuvant. 

5 Preimmune serum is collected prior to the first immunization. Each animal 

receives between about 0.1 mg and about 1000 mg of NHL protein associated with an 
acceptable immune adjuvant. Such acceptable adjuvants include, but are not limited 
to, Freund's complete, Freund's incomplete, alum-precipitate, water in oil emulsion 
containing Corynebacterium parvum and tRNA. The initial immunization consists of 

10 NHL protein or peptide fragment thereof in, preferably, Freund's complete adjuvant at 
multiple sites either subcutaneously (SC), intraperitoneally (IP) or both. Each animal 
is bled at regular intervals, preferably weekly, to determine antibody titer. The 
animals may or may not receive booster injections following the initial immunization. 
Those animals receiving booster injections are generally given an equal amount of 

15 NHL in Freund's incomplete adjuvant by the same route. Booster injections are given 
at about three week intervals until maximal titers are obtained. At about 7 days after 
each booster immunization or about weekly after a single immunization, the animals 
are bled, the serum collected, and aliquots are stored at about -20''C. 

Monoclonal antibodies (mAb) reactive with NHL are prepared by immunizing 

20 inbred mice, preferably Balb/c, with NHL protein. The mice are immunized by the IP 
or SC route with about 1 mg to about 100 mg, preferably about 10 mg, of NHL 
protein in about 0.5 ml buffer or saline incorporated in an equal volume of an 
acceptable adjuvant, as discussed above. Freund's complete adjuvant is preferred. 
The mice receive an initial immunization on day 0 and are rested for about 3 to about 

25 30 weeks. Immunized mice are given one or more booster immunizations of about 1 
to about 100 mg of NHL in a buffer solution such as phosphate buffered saline by the 
intravenous (IV) route. Lymphocytes, from antibody positive mice, preferably splenic 
lymphocytes, are obtained by removing spleens from immunized mice by standard 
procedures known in the art. Hybridoma cells are produced by mixing the splenic 

30 lymphocytes with an appropriate fusion partner, preferably myeloma cells, under 

conditions which will allow the formation of stable hybridomas. Fusion partners may 
include, but are not limited to: mouse myelomas P3/NSl/Ag 4-1; MPC-11; S-194 and 
Sp 2/0, with Sp 2/0 being preferred. The antibody producing cells and myeloma cells 
are fused in polyethylene glycol, about 1000 mol. wt., at concentrations from about 
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30% to about 50%. Fused hybridoma cells are selected by growth in hypoxanthine, 
thymidine and aminopterin supplemented Dulbecco*s Modified Eagles Medium 
(DMEM) by procedures known in the art. Supernatant fluids are collected form 
growth positive wells on about days 14, 18, and 21 and are screened for antibody 

5 production by an immunoassay such as solid phase immunoradioassay (SPERA) using 
NHL as die antigen. The culture fluids are also tested in the Ouchterlony precipitation 
assay to determine the isotype of the mAb. Hybridoma cells from antibody positive 
wells are cloned by a technique such as the soft agar technique of MacPherson, 1973, 
Soft Agar Techniques, in Tis:iue Culture Methods and Applications, Kruse and 

10 Paterson, Eds., Academic Press. 

Monoclonal antibodies are produced in vivo by injection of pristine primed 
Balb/c mice, approximately 0.5 ml per mouse, with about 2 x 10^ to about 6 x 10^ 
hybridoma cells about 4 days after priming. Ascites fluid is collected at 
approximately 8-12 days after cell transfer and the monoclonal antibodies are purified 

15 by techniques known in the art. 

In vitro production of anti- NHL mAb is carried out by growing the hybridoma 
in DMEM containing about 2% fetal calf serum to obtain sufficient quantities of the 
specific mAb. The mAb are purified by techniques known in the art. 

Antibody titers of ascites or hybridoma culture fluids are determined by 

20 various serological or immunological assays which include, but are not limited to, 
precipitation, passive agglutination, enzyme-linked immunosorbent antibody (ELISA) 
technique and radioimmunoassay (RIA) techniques. Similar assays are used to detect 
the presence of NHL in body fluids or tissue and cell extracts. 

It is readily apparent to those skilled in the art that the above described 

25 methods for producing monospecific antibodies may be utilized to produce antibodies 
specific for NHL peptide fragments, or a respective fuU-length NHL. 

NHL antibody affinity columns are made, for example, by adding the 
antibodies to Affigel-10 (Biorad), a gel support which is pre-activated with N- 
hydroxysuccinimide esters such that the antibodies form covalent linkages with the 

30 agarose gel bead support. The antibodies are then coupled to the gel via amide bonds 
with the spacer arm. The remaining activated esters are then quenched with IM 
ethanolamine HCl (pH 8). The column is washed with water followed by 0.23 M 
glycine HCl (pH 2.6) to remove any non-conjugated antibody or extraneous protein. 
The column is then equilibrated in phosphate buffered saline (pH 7.3) and the cell 



-84- 



wo 01/42434 



PCT/USOO/33065 



culture supematants or cell extracts containing full-length NHL or NHL protein 
fragments are slowly passed through the column. The column is then washed with 
phosphate buffered saline until the optical density (A280) falls to background, then the 
protein is eluted with 0.23 M glycine-HCl (pH 2.6). The purified NHL protein is then 
5 dialyzed against phosphate buffered saline. 

Pharmaceutically useful compositions comprising modulators of NHL may be 
formulated according to known methods such as by the admixture of a 
pharmaceutically acceptable carrier. Examples of such carriers and methods of 
formulation may be found in Remington's Pharmaceutical Sciences. To form a 
10 pharmaceutically acceptable composition suitable for effective administration, such 
compositions will contain an effective amount of the protein, DNA, RNA, modified 
NHL, or either NHL agonists or antagonists including tyrosine kinase activators or 
inhibitors. 

Therapeutic or diagnostic compositions of the invention are administered to an 
15 individual in amounts sufficient to treat or diagnose disorders. The effective amount 

may vary according to a variety of factors such as the individual's condition, weight, 

sex and age. Other factors include the mode of administration. 

The pharmaceutical compositions may be provided to the individual by a 

variety of routes such as subcutaneous, topical, oral and intramuscular. 
20 The term "chemical derivative" describes a molecule that contains additional 

chemical moieties which are not normally a part of the base molecule. Such moieties 

may improve the solubility, half-life, absorption, etc. of the base molecule. 

Alternatively the moieties may attenuate undesirable side effects of the base molecule 

or decrease the toxicity of the base molecule. Examples of such moieties are 
25 described in a variety of texts, such as Remington's Pharmaceutical Sciences. 

Compounds identified according to the methods disclosed herein may be used 

alone at appropriate dosages. Alternatively, co-administratipn or sequential 

administration of other agents may be desirable. 

The present invention also has the objective of providing suitable topical, oral, 
30 systemic and parenteral pharmaceutical formulations for use in the novel methods of 

treatment of the present invention. The compositions containing compounds 

identified according to this invention as the active ingredient can be administered in a 

wide variety of therapeutic dosage forms in conventional vehicles for administration. 

For example, the compounds can be administered in such oral dosage forms as tablets, 
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capsules (each including timed release and sustained release formulations), pills, 
powders, granules, elixirs, tinctures, solutions, suspensions, syrups and emulsions, or 
by injection. Likewise, they may also be administered in intravenous (both bolus and 
infusion), intraperitoneal, subcutaneous, topical with or without occlusion, or 
5 intramuscular form, all using forms well known to those of ordinary skill in the 
pharmaceutical arts. 

Advantageously, compounds of the present invention may be administered in a 
single daily dose, or the total daily dosage may be administered in divided doses of 
two, three or four times daily. Furthermore, compounds for the present invention can 

10 be administered in intranasal form via topical use of suitable intranasal vehicles, or 
via transdermal routes, using those forms of transdermal skin patches well known to 
those of ordinary skill in that art. To be administered in the form of a transdermal 
delivery system, the dosage administration will, of course, be continuous rather than 
intermittent throughout the dosage regimen. 

15 For combination treatment with more than one active agent, where the active 

agents are in separate dosage formulations, the active agents can be administered 
concurrently, or they each can be administered at separately staggered times. 

The dosage regimen utilizing the compounds of the present invention is 
selected in accordance with a variety of factors including type, species, age, weight, 

20 sex and medical condition of the patient; the severity of the condition to be treated; 
the route of administration; the renal, hepatic and cardiovascular function of the 
patient; and the particular compound thereof employed. A physician or veterinarian 
of ordinary skill can readily determine and prescribe the effective amount of the drug 
required to prevent, counter or arrest the progress of the condition. Optimal precision 

25 in achieving concentrations of drug within the range that yields efficacy without 
toxicity requires a regimen based on the kinetics of the drug's availability to- target 
sites. This involves a consideration of the distribution, equilibrium, and elimination 
of a drug. 

The present invention also relates to a non-human transgenic animal which is 
30 useful for studying the ability of a variety of compounds to act as modulators of NHL, 
or any alternative functional NHL in vivo by providing cells for culture, in vitro. In 
reference to the transgenic animals of this invention, reference is made to transgenes 
and genes. As used herein, a transgene is a genetic construct including a gene. The 
transgene is integrated into one or more chromosomes in the cells in an animal by 
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methods known in the art. Once integrated, the transgene is carried in at least one 
place in the chromosomes of a transgenic animal. Of course, a gene is a nucleotide 
sequence that encodes a protein, such as one or a combination of the cDNA clones 
described herein. The gene and/or transgene may also include genetic regulatory 

5 elements and/or structural elements known in the art. A type of target cell for 

transgene introduction is the embryonic stem cell (ES). ES cells can be obtained from 
pre-implantation embryos cultured in vitro and fused with embryos (Evans et al., 
1981, Nature 292:154-156; Bradley et aL, 1984, Nature 309:255-258; Gossler et al., 
1986, Proc. Natl Acad, Sci. USA 83:9065-9069; and Robertson et a!., \9m Nature 

10 322:445-448). Transgenes can be efficiently introduced into the ES cells by a variety 
of standard techniques such as DNA transfection, microinjection, or by relfovirus- 
mediated transduction. The resultant transformed ES cells can thereafter be combined 
with blastocysts from a non-human animal. The introduced ES cells thereafter 
colonize the embryo and contribute to the germ line of the resulting chimeric animal 

15 (Jaenisch, 1988, Science 240: 1468-1474). It will also be within the purview of the 
skilled artisan to produce transgenic or knock-out invertebrate animals 
(e.g., C elegans) which express the NHL transgene in a wild type background as well 
in C. elegans mutants knocked out for one or both of the NHL subunits. These 
organisms will be helpful in further determining the dominant negative effect of NHL 

20 as well as selecting from compounds which modulate this effect. 

The present invention also relates to a non-human transgenic animal which is 
heterozygous for a functional NHL gene native to that animal. As used herein, 
functional is used to describe a gene or protein that, when present in a cell or in vitro 
system, performs normally as if in a native or unaltered condition or environment. 

25 The animal of this aspect of the invention is useful for the study of the retinal specific 
expression or activity of NHL in an animal having only one functional copy of the 
gene. The animal is also useful for studying the ability of a variety of compounds to 
act as modulators of NHL activity or expression in vivo or, by providing cells for 
culture, in vitro. It is reiterated that as used herein, a modulator is a compound that 

30 causes a change in the expression or activity of NHL, or causes a change in the effect 
of the interaction of NHL with its ligand(s), or other protein(s). In an embodiment of 
this aspect, the animal is used in a method for the preparation of a further animal 
which lacks a functional native NHL gene. In another embodiment, the animal of this 
aspect is used in a method to prepare an animal which expresses a non-native NHL 
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gene in the absence of the expression of a native NHL gene. In particular 
embodiments the non-human animal is a mouse. In further embodiments the non- 
native NHL is a wild-type human NHL which is disclosed herein, or any other 
biologically equivalent form of human NHL gene as also disclosed herein. 

5 In reference to the transgenic animals of this invention, reference is made to 

transgenes and genes. As used herein, a transgene is a genetic construct including a 
gene. The transgene is integrated into one or more chromosomes in the cells in an 
animal by methods known in the art. Once integrated, the transgene is carried in at 
least one place in the chromosomes of a transgenic animal. Of course, a gene is a 

10 nucleotide sequence that encodes a protein, such as human or mouse NHL. The gene 
and/or transgene may also include genetic regulatory elements and/or structural 
elements known in the art. 

Another aspect of the invention is a non-human animal embryo deficient for 
native NHL expression, This embryo is useful in studying the effects of the lack of 

15 NHL on the developing animal. In particular embodiments the animal is a mouse. 
The animal embryo is also useful as a source of cells lacking a functional native NHL 
gene. The cells are useful in in vitro culture studies in the absence of NHL. 

An aspect of this invention is a method to obtain an animal in which the cells 
lack a functional gene NHL native to the animal. The method includes providing a 

20 gene for an altered form of the NHL gene native to the animal in the form of a 

transgene and targeting the transgene into a chromosome of the animal at the place of 
the native NHL gene. The transgene can be introduced into the embryonic stem cells 
by a variety of methods known in the art, including electroporation, microinjection, 
and lipofection. Cells carrying the transgene can then be injected into blastocysts 

25 which are then implanted into pseudopregnant animals. In alternate embodiments, the 
transgene-targeted embryonic stem cells can be coincubated with fertilized eggs or 
morulae followed by implantation into females. After gestation, the animals obtained 
are chimeric founder transgenic animals. The founder animals can be used in further 
embodiments to cross with wild-type animals to produce Fl animals heterozygous for 

30 the altered NHL gene. In further embodiments, these heterozygous animals can be 
interbred to obtain the non-viable transgenic embryos whose somatic and germ cells 
are homozygous for the altered NHL gene and thereby lack a functional NHL gene. In 
other embodiments, the heterozygous animals can be used to produce cells lines. In 
preferred embodiments, the animals are mice. 
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A further aspect of the present invention is a transgenic non-human animal 
which expresses a non-native NHL on a native NHL null background. In particular 
embodiments, the null background is generated by producing an animal with an 
altered native NHL gene that is non-functional, i.e. a knockout. The animal can be 

5 heterozygous {i.e. , having a different allelic representation of a gene on each of a pair 
of chromosomes of a diploid genome) or homozygous (i.e. , having the same 
representation of a gene on each of a pair of chromosomes of a diploid genome) for 
the altered NHL gene and can be hemizygous {i.e., having a gene represented on only 
one of a pair of chromosomes of a diploid genome) or homozygous for the non-native 

10 NHL gene. In preferred embodiments, the animal is a mouse. In particular 

embodiments the non-native NHL gene can be a wild-type or mutant allele including 
those mutant alleles associated with a disease. In further embodiments, the non-native 
NHL is a human NHL. In a further embodiment the non-native NHL gene is operably 
linked to a promoter. As used herein, operably linked is used to denote a functional 

15 connection between two elements whose orientation relevant to one another can vary. 
In this particular case, it is understood in the art that a promoter can be operably 
linked to the coding sequence of a gene to direct the expression of the coding 
sequence while placed at various distances from the coding sequence in a genetic 
construct. 

20 An aspect of this invention is a method of producing transgenic animals 

having a transgene including a non-native NHL gene on a native NHL null 
background. The method includes providing transgenic animals of this invention 
whose cells are heterozygous for a native gene encoding a functional NHL protein and 
an altered native NHL gene. These animals are crossed with transgenic animals of 

25 this invention that are hemizygous for a transgene including a non-native NHL gene to 
obtain animals that are both heterozygous for an altered native NHL gene and 
hemizygous for a non-native NHL gene. The latter animals are interbred to obtain 
animals that are homozygous or hemizygous for the non-native NHL and are 
homozygous for the altered native NHL gene. In particular embodiments, cell lines 

30 are produced from any of the animals produced in the steps of the method. 

The transgenic animals and cells of this invention are useful in the 
determination of the in vivo function of anon-native NHL in the central nervous 
. system and in other tissues of an animal. The animals are also useful in studying the 
tissue and temporal specific expression patterns of a non-native NHL throughout the 
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animals. The animals are also useful in determining the ability for various forms of 
wild-type and mutant alleles of a non-native NHL to rescue the native NHL null 
deficiency. The animals are also useful for identifying and studying the ability of a 
variety of compounds to act as modulators of the expression or activity of a non-native 

5 NHL in vivo, or by providing cells for culture, for in vitro studies. 

As used herein, a "targeted gene" or "Knockout" (KO) is a DNA sequence 
introduced into the germline of a non-human animal byway of human intervention, 
including but not limited to, the methods described herein. The targeted genes of the 
invention include nucleic acid sequences which are designed to specifically alter 

10 cognate endogenous alleles. An altered NHL gene should not fully encode the same 
NHL as native to the host animal, and its expression product can be altered to a minor 
or great degree, or absent altogether. In cases where it is useful to express a non- 
native NHL gene in a transgenic animal in the absence of a native NHL gene we 
prefer that the altered NHL gene induce a null lethal knockout phenotype in the 

15 animal. However a more modestly modified NHL gene can also be useful and is 
within the scope of the present invention. 

A type of target cell for transgene introduction is the embryonic stem cell (ES). 
ES cells can be obtained from pre-implantation embryos cultured in vitro and fused 
with embryos (Evans et al., 1981, Nature 292:154-156; Bradley et al., 1984. Nature 

20 309:255-258; Gossler et ah, 1986, Proc. Natl. Acad Sci. USA 83:9065-9069; and 

Robertson et al., 1986 Nature 322:445-448). Transgenes can be efficiently introduced 
into the ES cells by a variety of standard techniques such as DNA transfection, 
microinjection, or by retro virus-mediated transduction. The resultant transformed ES 
cells can thereafter be combined with blastocysts from a non-human animal. The 

25 introduced ES cells thereafter colonize the embryo and contribute to the germ line of 
the resulting chimeric animal (Jaenisch, 1988, Science 240: 1468-1474). 

The methods for evaluating the targeted recombination events as well as the 
resulting knockout mice are readily available and known in the art. Such methods 
include, but are not limited to DNA (Southern) hybridization to detect the targeted 

30 allele, polymerase chain reaction (PGR), polyacrylamide gel electrophoresis (PAGE) 
and Western blots to detect DNA, RNA and protein. 

The following examples are provided to illustrate the present invention 
without, however, limiting the same hereto. 
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EXAMPLE 1 
Characterization of DNA Molecules Encoding NHL 
M68/DcR3 identification - The human osteoprotegerin (OPG) sequence (Acc. 

5 # U94332), which is a member of the TNFR-related family, was used to searched 
Genbank using the programs TBLASTN and TFASTX3 to identify novel gene family 
members. Two EST sequences (GenBank Acc. # AA155701 and AA025672) were 
identified that showed sequence similarities to the cysteine repeats of the OPG 
sequence. These EST sequences were then used to identify additional EST sequences, 

10 which formed a single EST cluster (GenBank Acc. #s aa577603, aa603704, aa613366, 
aal58406, w67560, aa325843, aal55646, aa025673, aa514270, m91489). Two clones 
were further characterized, which were derived from colon tumor and germ cell tumor 
libraries (Research Genetics, Inc). DNA sequence analysis revealed two alternatively 
spliced forms of the 5'-end UTR of M68/DcR3. The M68/DcR3 open reading frame 

15 was confirmed by sequence analysis of clones obtained by PGR cloning from a normal 
human cDNA library (Clontech). 

M68/DcR3 BAC identification and sequencing - To further delineate the gene 
structure of M68/DcR3, genomic DNA was obtained using a human "Down to the 
Well" ^ genomic bacterial artificial chromosome (BAC) library (Genome Systems, 

20 Inc.) according to the manufacturer's protocol. Two sets of PGR primers, C68.36F: 
5'-CACAGGTTCAGCATGTTTGTGCGTC-3* (SEQ ED N0:4) and C68.275R: 
5'-CACAGTCCCTGCTGGCCTCTGTCTA-3' (SEQ ID N0:5), and E68.7I5F: 
5*-CAGGACATCTCCATCAAGAGGCTGC-3' (SEQ ID N0:6) and E68.972R: 
5*-AATAAGAGGGGGCCAGGATCAGTGC-3' (SEQ ID N0:7), were used to carry 

25 out PGR reactions to identify positive wells that contained the full-length M68/DcR3 
, gene. The PGR conditions used were 94°C for 9min, 35 cycles of (94°C, 30 sec, 68°C 
3 min.) followed by 72**C for 10 min. Two positive BAC clones were identified and 
characterized by restriction digestion and B AC-end sequence analyses, of which 
hbml68 was selected for shotgun sequencing. 

30 A shot-gun library for BAC hbml68 was constructed using a conventional 

strategy. Briefly, two 150-ml bacterial cultures were combined and purified using a 
modified protocol of the plasmid-Maxi kit (QIAGEN) followed by CsCl gradient 
purification. After butanol extraction and isopropanol precipitation, BAC DNA was 
nebulizied at 10 psi for 60 seconds to generate randomly sheared fragments. 
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Following ethane! precipitation, the fragments were end-repaired using T4 polymerase 
(Promega) and BstXI adaptors (Invitrogen) were ligated overnight. Removal of 
excess, unligated adaptors and size selection was performed using acDNA sizing 
column (Life Technologies, Inc.) to generate genomic fragments in the size range of 

5 1500 to 3000 bp. Adaptor ligated fragments were cloned into a modified pBlueScript 
STC vector (Stratagene) and transformed in XL2-Blue ultracompentent cells 
(Stratagene). Approximately 1000 clones were isolated, plasmids were purified using 
the Turbo miniprep kits (QIAGEN), and both plasmid ends were sequenced with the 
BigDye terminator kits (Perkin-Elmer). Sequence data were assembled using 

10 Phred/Phrap/Consed where single-stranded and gap regions were closed using a 
directed sequencing strategy. 

NHL identification and sequencing - The genomic clone for the NHL gene 
was obtained and sequenced. The transcript was identified through exon prediction 
using GRAIL2 and sequence alignment to a contiguous 4.5 kilobase region of 

15 chromosome 4 (88% sequence identity). The complete exon structure of NHL was 
subsequendy confirmed by RT-PCR analysis. The exon structure was confirmed by 
RT-PCR using polyA RNA from a human colorectal adenocarcinoma cell line, 
SW480 (Clontech). Primers were designed based on the genomic sequence that were 
predicted to be exons. RT-PCR reaction were carried out with SW480 polyA RNA 

20 using standard conditions with TaqGoId Enzyme at 94°C for 12min, 35 cycles of 
(94*^0, 30 sec, 60C, 30 sec, and 68**C 2-6 min.) followed by 68*'C for 7 min. Most 
sequence confrimation was accomplished by RT-PCR, although first junction between 
exon 1 and 2 was confirmed by 5'RACE and junctions between exon 26-29 were by 
RCCA. The primers used were as follows: 



Junction of Exons 


Confirmed bv Primers 


H01/H02 


hdkw (5'RACE) 


H02/H03 


hdiy,hdiz 


H03-H09 


hdid,hdie,hdja,hdjb 


H09-H13 


hdja,hdie 


H13-H18 


hdje.hdjf 


H18-H23 


hdjg.hdjh 


H23-H26 


hdji.hdjj 


H26-H29 


hdkv,r543(RCCA) 


H29-H31 


hdij,hdmu,hdnd,hdne 
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H34/H35 



H32/H34 



H31/H32 



hdij.hdmu 

hdip,hdil,hdmv,hdik,hdli 
hdng.hdnh 



5 HDID - 5'-GTGAATCGCATCCTGGAGAG-3' (SEQ ID N0:8); 

HDIE - 5 -GTCTCCAGGCAGCTCAACAG-3' (SEQ ID N0:9); 

HDU 5'-ACCCTGTCCCTCCTGTCTGA-3' (SEQ ID NO:10); 

HDIY - 5 -AGACCCTAAGATGTTCGGAG-3' (SEQ ID NO: 1 1); 

HDIZ - 5'-GATGACCTGTGTGAGTTGCG-3' (SEQ ID NO: 12); 
10 HDJA - 5 '-CGCAACTCACACAGGTCATC-3' (SEQ ID N0:13); 

HDJB - 5 -GGAGTCAGGTCAAAGGATGC-3' (SEQ ID NO: 14); 

HDJC - 5 -GCATCCTTTGACCTGACTCC-3'(SEQ ID N0:15); 

HDJD - 5 '-GGTCTGAAACGTGATCTGGG-3 ' (SEQ ED NO: 1 6); 

HDJE - 5 -CCCAGATCACGTTTCAGACC-3' (SEQ ID NO: 17); 
15 HDJF - 5'-CGATGATGTGTGGGTTCTCC-3' (SEQ ID NO: 18); 

HDJG - 5-GGAGAACCCACACATCATCG-3'(SEQ ID NO: 19); 

HDJH - 5 -CGTGTCTGAGAAGTCCAGCC-3'(SEQ ID N0:20); 

HDJI - 5'-GGCTGGACTTCTCAGACACG-3'(SEQ ID N0:21); 

HDJJ - 5 -ACAGCATCTTCTCCACGCAC-3'(SEQ ID N0:22); 
20 HFMU - 5 -AGTCCTCTGGCTTTGCAGTG-3' (SEQ ID NO:23); 

HDKV - 5 '-TGTGCGTGGAGAAGATGCTG-3' (SEQ ID NO:24); 

HDKW - 5 -GGCTGGAAAGGGAAGTCTAC-3' (SEQ ID NO:25); 

HDND - 5'-TGGTTCAGGTGCTCTTGGGG-3'(SEQ ID NO:26); 

HDNE - 5 -CGTGAAGCAGGAGTTGAGCC-3' (SEQ ID NO:27); 
25 HDIK - 5 -ATCTTGCTCTGGGTCTT.CCC-3' (SEQ ID NO:28); 

HDIL - 5 -CACTGCAAAGCCAGAGGACT-3'(SEQ ID NO:29); 

HDIP - 5 -ATAAGCAAGACGACGACCTC-3' (SEQ ID NO:30); 

HDLI - 5 -CTATrCTGTTGGGTGGGTTC-3'(SEQ ID N0:31); 

HDMV - 5-CGTGCCTCCTGTGCTTACCC-3'(SEQ ID N0:32); 
30 HDNG - 5 -CAGACCCCAAGGTAGCTCAG-3' (SEQ ID NO:33); 

HDNH - 5'-GGAAGACCCAGAGCAAGATC-3' (SEQ ID NO:34). 
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Amplified product were subject to direct sequencing after purification from an agarose 
gel or cloned into a TOPO PGR cloning vector (Invitrogen) for sequencing. 
Multiple sequence alignment of NHL to known helicases showed that NHL contains 

5 all the seven critical helicase domains. BLAST analysis of the predicted 1,219 amino 
acid sequence (see Figure 2, SEQ ID N0:2) reveal an approximately 26% sequence 
identity and 48% sequence similarity to the RAD3/ERCC2 gene family of DNA 
helicases (see Figure 3). Review of this sequence data shows that two partial human 
cDNA clones (Acc. No, al080127 and ab029011) are deposited. No. al080i27 covers 

10 exon 25-35 while ab02901 1 covers exons 9-35, Ab029011 starts at amino acid 240 
of the full length human NHL protein disclosed herein, but also differs at exon 35 and 
appears to be a fusion transcript with M68. This cDNA was isolated from brain 
tissue, which has been known to express rare transcripts. 

15 EXAMPLE 2 

Northern Analysis of human NHL Expression 
Messenger RNA (mRNA) obtained from human brain, heart, skeletal muscle, 
colon, thymus, spleen, kidney, liver, small intestine, placenta, lung, and peripheral 
blood leukocytes. Two \ig of polyA"^ RNA were run on each lane a denaturing 
20 formaldehyde 1% agarose gel, and transferred to a charged-modified nylon membrane. 
The probe was made using a 733 bp fragment derived from 1 174-1907 nt of the NHL 

32 

cDNA. This fragment was labeled via the P dCTP random priming method 
(Ambion). Hybridization was carried in ExpressHyb (Clontech) according to the 
manufacturer's protocol except for the final wash, which was at 55*^C. Membranes 
25 were exposed to X-ray film with intensifying screen at -80**C overnight. The Northern 
data is presented in Figure 4. Note hybridization of the NHL probe to an 
approximately 4.4 kb transcript. The 7.5 kb transcript may suggest an alternative 
splicing of the NHL RNA. 
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EXAMPLE 3 
Chromosomal localization 
To map the position of M68/NHL in the human genome, primers C68.36F and 

5 C68.275R, were used to carry out PGR reactions to 93 clones of the MIT GeneBridge 
4 panel (Research Genetics) and results were submitted to MTT for analysis. 
M68/DcR3 was mapped to the extreme telomere of chromosome 20, at 20ql3.3, 28cR 
from D20S 173 with a lod score of 13. An analogous procedure was also carried out 
with the 83 clones of the Stanford G3 radiation hybrid panel, with PGR results 

10 submitted to the Stanford Genome Center for analysis. Analysis using,another pair of 
PGR primers specific to NHL yielded the same result. For fluorescence in situ (FISH) 
analysis, the normal human male fibroblast cell line, L136 (Coriell Cell Repository, 
Camden, NJ) was arrested in mitosis with colcemid (10 /ig/ml). A human 
chromosome 20 a-satellite probe (Vysis, Downers Grove, IL) was directly labeled 

15 with Spectrum- Orange dUTP and was used to identify chromosome 20. The M68 
BAG clone was directly labeled with SpectrumGreen dUTP by nick translation 
(Vysis). Slides were counterstained with DAPI stain and viewed under an Olympus 
microscope with narrow blue and DAPI/TRITC filters. Fifty metaphase cells were 
scored to verify that the M68 probe was located on the same chromosome as the 

20 Human Chromosome 20 probe. Radiation hybrid chromosomal mapping reconfirms 
that it is linked to M68 locus,at 20ql3.3. 
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WHAT IS CLAIMED IS: 

1. A purified DNA molecule encoding a mammalian NHL pirotein. 



2. A purified DNA molecule of claim I encoding a human NHL protein 
5 which comprises the amino acid sequence 





MPKIVLNGVT VDFPFQPYKC 


QQEYMTKVLE 


CLQQKVNGIL 


ESPTGTGKTL 


CLLCTTLAWR 




EHLRDGISAR KIAERAQGEL 


FPDRALSSWG 


NAAAAAGDPI 


ACYTDIPKII 


YASRTHSQLT 




QVINELRNTS YRPKVCVLGS 


REQLCIHPEV 


KKQESNHLQI 


HLCRKKVASR 


SCHFYNNVEE 




KSLEQELASP ILDIEDLVKS 


GSKHRVCPYY 


LSRNLKQQAD 


IIFMPYNYLL 


DAKSRRAHNI 


10 


DLKGTWIFD EAHNVEKMCE 


ESASFDLTPH 


DLASGLDVID 


QVLEEQTKAA 


QQGEPHPEFS 




-ADSPSPGLNM ELEDIAKLKM 


ILLRLEGAID 


AVELPGDDSG 


VTKPGSYIFE 


LFAEAQITFQ 




TKGCILDSLD QIIQHLAGRA GVFTNTAGLQ 


KLADIIQIVF 


SVDPSEGSPG 


SPAGLGALQS 




YKVHIHPDAG HRRTAQRSDA WSTTAARKRG 


ECVLSYWCFSP 


GHSMHELVRQ 


GVRSLILTSG 




TLAPVSSFAL EMQIPFPVCL 


ENPHIIDKHQ 


IWVGWPRGP 


DGAQLSSAFD 


RRFSEECLSS 


15 


LGKALGNIAR WPYGLLIFF 


PSYPVMEKSL 


EFWRARDLAR 


KMEALKPLFV 


EPRSKGSFSE 




TISAYYARVA APGSTGATFL 


AVCRGKASEG 


LDFSDTNGRG 


VIVTGLPYPP 


RMDPRWLKM 




QFLDEMKGQG GAGGQFLSGQ 


EWYRQQASRA 


VNQAIGRVIR 


HRQDYGAVFL 


CDHRFAFADA 




RAQLPSWVRP HVRVYDNFGH VIRDVAQFFR 


VAERTMPAPA 


PRATAPSVRG 


EDAVSEAKSP 




GPFFSTRKAK SLDLHVPSLK QRSSGSPAAG 


DPESSLCVEY 


EQEPVPARQR 


PRGLLAALEH 


20 


SEQRAGSPGE EQAHSCSTLS 


LLSEKRPAEE 


PRGGRKKIRL 


VSHPEEPVAG 


AQTDRAKLFM 




VAVKQELSQA NFATFTQALQ 


DYKGSDDFAA 


LAACLGPLFA 


EDPKKHNLLQ 


GFYQFVRPHH 




KQQFEEVCIQ LTGRGCGYRP 


EHSIPRRQRA 


QPVLDPTGRT 


APDPKLTVST 


AAAQQLDPQE 




HLNQGRPHLS PRPPPTGDPG 


SQPQWGSGVP 


RAGKQGQHAV 


SAYLADARRA 


LGSAGCSQLL 




AALTAYKQDD DLDKVL.AVLA 


ALTTAKPEDF 


PLLHRFSMFV 


RPHHKQRFSQ 


TCTDLTGRPY 


25 


PGMEPPGPQE ERLAVPPVLT 


HRAPQPGPSR 


SEKTGKTQSK 


ISSFLRQRPA GTVGAGGEDA 




GESQSSGPPH GPAASEWGL* 


(SEQ ID NO 


:2) - 







3, An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 2. 

30 

4. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 3. 
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5. A process for expressing a NHL protein in a recombinant host cell, 
comprising: 

(a) transfecting the expression vector of claim 3 into a suitable host cell; and, 
5 (b) culturing the host cells of step (a) under conditions which allow 

expression of said NHL protein from said expression vector. 

6. A purified DNA molecule encoding a human NHL protein which 
consists of the amino acid sequence 

10 -MPKIVLNGVT VDFPFQPYKC QQEYMTKVLE CLQQKVNGIL ESPTGTGKTL CLLCTTLAWR 
EHLRDGISAR KIAERAQGEL FPDRALSSWG NAAAAAGDPI ACYTDIPKII YASRTHSQLT 
QVINELRNTS YRPKVCVLGS REQLCIHPEV KKQESNHLQI HLCRKKVASR SCHFYNNVEE 
KSLEQELASP ILDIEDLVKS GSKHRVCPYY LSRNLKQQAD IIFMPYNYLL DAKSRRAHNI 
DLKGTWIFD EAHNVEKMCE ESASFDLTPH DLASGLDVID QVLEEQTKAA QQGEPHPEFS 

15 ADSPSPGLNM ELEDIAKLKM ILLRLEGAID AVELPGDDSG VTKPGSYIFE LFAEAQITFQ 
TKGGILDSLD QIIQHLAGRA GVFTNTAGLQ KLADIIQIVF SVDPSEGSPG SPAGLGALQS 
YKVHIHPDAG HRRTAQRSDA WSTTAARKRG KVLSYWCFSP GHSMHELVRQ GVRSLILTSG 
TLAPVSSFAL EMQIPFPVCL ENPHIIDKHQ IWVGWPRGP DGAQLSSAFD RRFSEECLSS 
LGKALGNIAR WPYGLLIFF PSYPVMEKSL EFWRARDLAR KMEALKPLFV EPRSKGSFSE 

20 TISAYYARVA APGSTGATFL AVCRGKASEG LDFSDTNGRG VIVTGLPYPP RMDPRWLKM 
QFLDEMKGQG GAGGQFLSGQ EWYRQQASRA VNQAIGRVIR HRQDYGAVFL CDHRFAFADA 
RAQLPSWVRP HVRVYDNFGH VIRDVAQFFR VAERTMPAPA PRATAPSVRG EDAVSEAKSP 
GPFFSTRKAK SLDLHVPSLK QRSSGSPAAG DPESSLCVEY EQEPVPARQR PRGLLAALEH 
SEQRAGSPGE EQAHSCSTLS LLSEKRPAEE PRGGRKKIRL VSHPEEPVAG AQTDRAKLFM 

25 VAVKQELSQA NFATFTQALQ DYKGSDDFAA LAACLGPLFA EDPKKHNLLQ GFYQFVRPHH 
KQQFEEVCIQ LTGRGCGYRP EHSIPRRQRA QPVLDPTGRT APDPKLTVST AAAQQLDPQE 
HLNQGRPHLS PRPPPTGDPG SQPQWGSGVP RAGKQGQHAV SAYLADARRA LGSAGCSQLL 
AALTAYKQDD DLDKVLAVIA ALTTAKPEDF PLLHRFSMFV RPHHKQRFSQ TCTDLTGRPY 
PGMEPPGPQE ERLAVPPVLT HRAPQPGPSR SEKTGKTQSK ISSFLRQRPA GTVGAGGEDA 

30 GPSQSSGPPH GPAASEWGL* (SEQ ID N0:2). 

7. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 6. 
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8. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 7. 

9. A process for expressing a NHL protein in a recombinant host cell, 
5 comprising: 

(a) transfecting the expression vector of claim 7 into a suitable host cell; and, 

(b) culturing the host cells of step (a) under conditions which allow 
expression of said NHL protein from said expression vector. 

10 10. A purified DNA molecule which comprises the nucleotide sequence as 

set forth in SEQIDNO:!, 

11. An expression vector for expressing a NHL protein in a recombinant 
host ceil wherein said expression vector comprises a DNA molecule of claim 10. 

15 

12. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 11. 

13. A purified DNA molecule which consists of the nucleotide sequence as 
20 set forth in SEQ E) NO: 1 . 

14. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 13. 

25 15. A host cell which expresses a recombinant NHL protein wherein said 

host cell contains the expression vector of claim 14. 

16. A purified DNA molecule of claim 13 which consists of the nucleotide 
sequence from about nucleotide 828 to about nucleotide 4587, as set forth in SEQ ID 

30 N0:1. 

17. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 16. 
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18. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 17. 

19. A substantially purified NHL protein which comprises the amino acid 
5 sequence as set forth in SEQ K) N0:2. 

20. A substantially purified NHL protein which consists of the amino acid 
sequence as set forth in SEQ ID NO:2. 

10 21. A substantially purified NHL protein which comprises the amino acid 

sequence as set forth in SEQ JD N0:2, wherein said protein is a product of a DNA 
expression vector comprising SEQ ED N0:1 and contained within a recombinant host 
cell. 

15 22. A method of identifying modulators of NHL activity, comprising: 

(a) combining a test compound with a NHL protein, wherein NHL comprises 
the amino acid sequence as set forth in SEQ ID N0:2; and, 

(b) measuring the effect of the test compound on the NHL protein. 

20 23. An isolated DNA molecule which comprises the nucleotide sequence 

as set forth in SEQ ID N0:3. 

24. An isolated DNA molecule of claim 20 which comprises from about 
nucleotide 47000 to about nucleotide 85500 of SEQ ID NO:3. 

25 . 

25. An isolated DNA molecule of claim 23 which comprises from about 
nucleotide 47095 to about nucleotide 853 16 of SEQ E) N0:3. 

26. A substantially purified NHL protein of claim 21 wherein said protein 
30 is a product of a DNA expression vector comprising from about nucleotide 828 to 

nucleotide 4587, as set forth in SEQ ID N0:1, and contained within a recombinant 
host cell. 
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1/12 



AGTCAGCCCT GCT6CCAGCC AGT6CCGGGT 6CTGGGGACT CAGGGAGGCC CGCCGGGACC ACTGCGGGAC 
Ar-rrirrrRA RTAGAAGCTG GAACGCAGGA GAGGAAGGA6 AGGGGGCGGT CAGGGCTCTC AGGAGCCGGG 
IJgCAGCCG TTTTCAAATT TTCA6GAAAG CGGTCGGCTC ACACTCGAGC AGTAAAAAGA 
S S G^^S Sci CCGGGCAATG GTGGTGGCTC GGCCTAGAGA GGCGGWG 
GAACGCAGAC CCTGGTG6GG 6AATGACATC AAGGGA66AG ACGGGCGGGA CCCCAGATTT CTGCCTGTGG 
rTRATGGMG TGAGGTTCAC TGGCCAGCGG A6CCGGACAC AGAACGCGCA AAACGCCGTG TA6GCCT66A 
S^JrSir AGCAGGCGGA CCCCCTCC6C GGGGGAACAG TTTCCGCCGG GAGCACAAAG CAACGGACC6 
S gS?^ Sca GCGCCGACTG CGCGCCTCTG CCCGCGAAAA CTCm^T 
GCTGACA6CT GGGGACGG6T GGCGGCCCTC GACTGGA6TC GGTTGAGTTC CTGAGGGACC CCGGTTCT6G 
SgGHCGCC GCGGAGACAA GTGAGCAGTC TGTGCCATAG GGATTCTCGA AGA6AACA6C GTTCT6TCCC 
ArTrnriTr rTPRrATCGC TTACCAGGAG TGCCCGAGAC CCTAA6ATGT TCG6AGTGGT TTTllLbLAt 

^? Sctgcccc tcagccacgc tctgtgccct tctgagaaca ggctgataig cccaagatag 
Scttccctt tccagcccta caaatgccaa caggagtaca tgaccaaggt 
S Skctggag agccctacgg gtacagggaa gacgctgjgc 

rTrrrrrrrn rrACGCTGGC CTGGCGAGAA CACCTCCGAG ACGGCATCTC TGCCCGCAAG ATTGCCfaAbA 

Skgss ccttgtcatc ctggggcmc gct^tgctg 

S IS «S S Sc^3 

JJ^SS^A GTAACCATCT ACAGATCCAC TTGTGCCGTA A^^ 
AAGTCGCTCC TGTCATTTa ACAACAACGT AGAAGAAAAA AGCCTOAGC *^TGGC "^^^'^ 
CTGGACAHG AGGACTTGGT CAAGAGCGGA AGCAAGCACA GGGTCTGCCC TTACTACCTG TKCffiWtt 
TCAAGTAGCA AGCCGACATC ATAHCATGC CGTACAATTA CnGTTGGAT GCCAAGAfiCC GCAGAECALA 

cSS Sgtgat chtgacgaa gctcacaacg tggagaagat gtgtga^ 

TCGGCATCCT TTGACCTGAC TCCCCATGAC CTGGCTTCAG GACTGGACGT CATA6ACCAG ff[GCT|AGG 
AGCAGACCAA GGCAGCGCAG CAGGGTGAGC CCCACCCGGA GTTCAGCGCG GACTCCCCCA GCCCAffiGCT 
GAACATGGAG CTGGAAGACA TTGCAAAGCT GAAGATGATC CTGCTGCGCC TGGAGGGGGC CATCGATGCT 
GHGAGCTGC CTGGAGACGA CAGCGGTGTC ACCAAGCCAG GGAGCTACAT CTTTGAGCTG TTTGCTGAAG 
CCCA6ATCAC GTTTCAGACC AAGGGCTGCA TCCTGGACTC GCTGGACCAG ATCATCCAGC ACCTGGCAGG 
ACGT6CTGGA GTGTTCACCA ACACG6CCGG ACTGCAGAAG CTGGCGGACA TTATCCA6AT TGTGTO6T 
GTGGACCCCT CCGAGGGCAG CCCTGGTTCC CCAGCAGGGC TGGGGGCCTT ACAGTCCTAT AAGGT6CACA 

?™gI Sac cggaggacgg ctcagcggtc tgatgcct^ agcaccactg cagc agam 

GCGAGGGAAG GTGCTGAGCT ACTGGTGCTT CAGTCCCGGC CACAGCATGC ACGAGCTGGT CCGCCAGG6C 
GTCCGCTCCC TCATCCHAC CAGCGGCACG CTGGCCCCGG TGTCCTCCTT TGCTCT66AG ATGCAGATCC 
CTTTCCCAGT CTGCCTGGAG AACCCACACA TCATCGACAA GCACCAGATC TGG6TGGGGG TC6TCCCCA6 
AGGCCCCGAT GGAGCCCAGT TGAGCTCCGC GTTTGACAGA CGGTTTTCCG AG6AGTGCTT ATCCTCCCTG 
GGGAAG6CTC TGGGCAACAT CGCCCGCGTG GTGCCCTATG GGCTCCTGAT CTTCTTCCCT TCCTATCCT6 
TCAT6GAGM GAGCCT6GA6 TTCTGGCGGG CCCGCGACTT GGCCA6GAAG ATGGAGGCGC TGMGCCGCT 
GTTT6TGGAG CCCAGGAGCA AAGGCAGCH CTCCGAGACC ATCAGTGCTT ACTATGCAAG GGTT6CCGCC 
CCTG66TCCA CCGGCGCCAC CHCCIGGCG GTCTGCCGGG GCAAG6CCAG CGA6GG6CTG GACTTCTCA6 
ACACGAATGG CCGTGGTGTG AHGTCACGG GCCTCCCGTA CCCCCCACGC ATG6ACCCCC GGGTTGTCCT 
CAAGATGCAG TTCCTGGATG AGATGAAGGG CCAGGGTGGG GCTGGGGGCC A6TTCCTCTC TGGGCAG6AG 
TGGTACCGGC AGCAGGCGTC CAGGGCTGTG AACCAGGCCA TCGGGCGAGT GATCCGGCAC CGCCAGGACT 
ACGGAGCTGT CTTCCTCTGT GACCACAGGT TCGCCTTTGC CGACGCAAGA GCCCAACTGC CCTCCTGGCT 
GCGTCCCCAC GTCAGGGTGT ATGACAACH TGGCCATGTC ATCCGAGACG TGGCCCAGTT CTTCCGTCTT 
GCCGAGCGAA CTATGCCAGC GCCGGCCCCC CGGGCTACAG CACCCAGTGT GCGTGGAGAA GATGCTGTCA 
GCGAGGCCAA GTCGCCTGGC CCCTTCnCT CCACCAGGAA AGCTAAGAGT CTGGACCTGC ATGTCCCCAG 
CCTGAA6CAG AGGTCCTCAG GGTCACCAGC TGCCGGGGAC CCCGAGAGTA GCCTGTGTGT GGAGTATGAG 
CAGGAGCCAG TTGCTGCCGG GCAGAGGCCC AGGGGGCTGC TGGCCGCCCT GGAGCACAGC GAAGAGCGGG 

FIG.1A 
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CGGGGAGCCC TGGCGA6GAG CAGGCCCACA GCTGCTCCAC CCTGTCCCTC CTGTCTGAGA 
AGAGGCCGGC AGAAGAACCG CGAGGAGGGA GGAAGAAGAT OGGCT GTC AGCCACCCGG 
A6GAGCCCGT GGCTGGTGCA CAGACGGACA GGGCCAAGCT CnCATGGTG GCCGTGMGC 
CCAAGCCAAC mGCCACCT TCACCCAGGC CCTCCAGGAC TA AA GGTT 
CCGATGACn GGCCGCCCTG GCCGCCTGTC TCGGCCCCCT CTTT6CTGAG GACCCCAAGA 
AGCACAACCT GCTCCAAGGC HCTACCAGT HGIGCGGCC CCACCATAAG CAGCA6TTTG 
WCCAGCTG ACAGGACGAG GCTGTGGCTA TCGGCCTGAG CACA^CAm 
CCCGAAG6CA 6CGGGCACA6 CCGGTCCTGG ACCCCACT6G AAGAACGGCG CC66ATCCCA 
AGCTGACCGT GTCCACGGCT GCAGCCCAGC AGCTGGACCC CCAAGAGCAC CTGAACCAG6 
ScCA CCTGTCGCCC AGGCCACCCC CAACAGGAGA CCCTGGC^^^ m^^Ag 
GGGGGTCTGG AGTGCCCAGA GCAGGGAAGC AG6GCCAGCA CGCCGTGAGC 6CCTACCTGG 
CTGAT6CCCG CAGGGCCCTG 6GGTCCGCGG GCTGTAGCCA ACTCTTGGCA GCGCTGACAG 
CCWAAGCA AGACGACGAC CTCGACAAGG TGCTGGCTGT GTreGCCGCC CTGACCACTG 
CAAAGCCAGA GGACHCCCC CTGCTGCACA GGHCAGCAT GTTTGTGCGT CCACACCACA 
^CAGACG TGCACAGACC TGACCGGCCG GC^^^^^^^^ GGCA G^G^^ 
CACCGGGACC CCAGGAGGAG AGGCHGCCG T6CCTCCTGT GCnACCCAC AG6GCTCCCC 
AACCAGGCCC CTCAC6GTCC GAGAAGACCG GGAAGACCCA GAGCAA6ATC TCGTCCTTCC 
HAGACAGAG GCCAGCAGGG ACTGTGG6G6 CGG6C6GTGA GGAT6CA6GT CCCAGCCA6T 
CCTCAGGACC TCCCCACGGG CCTGCAGCAT CTGAGT6GGG CCTCIAGGAT 6TGCCCA6CC 
TGCCACACCG CCTCCAGGAA GCAGAGCGTC ATGCAGGTCT TCTG6CCA6A GCCCCAGT6A 
GTGCCCACGG AGGCCCCCAG CACACCCAAC GT6GCTTGAT GACCTGCCTG TCCAGCTCTG 
GTGGGCCAAG AACCCACCCA ACAGAATA6G CCAGCCCATG CCAGCCGGCT TGGCCG6CTG 
CA6GCCTCAG GCAGGCGG6G CCCAT6GTTG GTCCCTGC66 TGGGACC6GA TCTGGGCCTG 
CCTCTGAGAA GCCCTGAGCT ACCTTGGGGT CTGGGGTGGG TTTCTGGGAA AGIGCHCCC 
CAGAACTTCC CTGGCTCCTG GCCTGTGAGT GGTGCCACAG G6GCACCCCA GCTGAGCCCC 
TCACGGGGAA GGAGGAGACC CCCGTGGGCA CGTGTCCACT TTTAATCAGG GGACAGGGCT 
CTCTAATAAA GCTGCTGGCA GTGCCC (SEQ ID N0:1). 
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MPKIVLNGVT VDFPFQPYKC QQEYKTKVLE CLQQKVNGIL ESPTGTGKTL CLLCTTLAWR 
EHLRDGISAR KIAERAQGEL FPDRALSSWG NAAAAAGDPI ACYTDIPKII YASRTHSQLT 
QOELRNTS YRPKVCVLGS REQLCIHPEV KKQESNHLQI HLCRKKVASR SCHFYNNVEE 
KSLEQEL^P ILDIEDLVKS GSKHRVCPYY LSRNLKQQAD IIFMPYNYLL DAKSRRAHNI 
DLKGTVvfFD EAHNVEKMCE ESASFDLTPH DLASGLDVID QVLEEQTKAA QQGEPHPEFS 
"gL M ELEDIAKLKM ILLRLEGAID AVELPGDDSG VTKPGSYIFE LFAEAQITFQ 
TKGCILDSLD QIIQHLAGRA GVFTNTAGLQ KLADIIQIVF SVDPSEGSPG SPAGLGALQS 

Shpdag 2™a wsttaarkrg kvlsywcfsp ^mR^ m^s^ 

TUPVSSFAL EMQIPFPVCL ENPHIIDKHQ IWVGVVPRGP DGAQLSSAFD RRFSEECLSS 

lgkalgniar vvpyglliff psypvmeksl efwrardur kmealkplfv eprskgsfse 
tSyarva apgstgatfl avcrgkaseg ldfsdtngrg vivtglpypp rmdprvvlkm 
QFLDEMKGQG gaggqflsgq ewyrqqasra vnqaigrvir hrqdygavfl cmmm 
mPSWVRP hvrvydnfgh virdvaqffr vaertmpapa pratap vrg dav eaksp 
gpffstrkak sldlhvpslk qrssgspaag dpesslcvey eqepvparqr prgllmleh 

SEQRAGSPGE EQAHSCSTLS LLSEKRPAEE PRG6RKKIRL VSHPEEPVAG AQTDRAKLFM 
VAVKQELSQA NFATFTQALQ DYKGSDDFAA LAACLGPLFA EDPKKHNLLQ GFYQFVRPHH 

kqqfeevciq ltgrgcgyrp ehsiprrqra qpvldptgrt apdpkltvst aaaqqldpqe 

HLNQGRPHLS PRPPPTGDPG SQPQWGS6VP RAGKQGQHAV SAYLADARRA LGSA6CSQLL 
AALTAYKQDD DLDKVUVU ALHAKPEDF PLLHRFSMFV RPHHKQRFSQ TCTDLTGRPY 
PGMEPPGPQE ERLAVPPVLT HRAPQPGPSR SEKTGKTQSK ISSFLRQRPA GTVGAGGEDA 
GPSQSSGPPH 6PAASEWGL* (SEQ ID N0:2). 

FIG. 2 
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SEQUENCE LISTING 



<110> APPLICANT: Merck fit Co., Inc. 

<120> TITLE: DNA MOLECULES ENCODING HUMAN NHL. A DNA 
HELICASE 

<130> DOCKET/FILE REFERENCE: 205S5 PCT 
<160> NUMBER OF SEQUENCES: 38 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 

<210> SEQ ID N0:1 

<211> LENGTH: 4946 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: (828) . . . (4487) 

<400> SEQ ID N0:1 

agtcagccct gctgccagcc agtgccgggt gctggggact cagggaggcc cgccgggacc 60 

actgcgggac agtgagccga gcagaagctg gaacgcagga gaggaaggag agggggcggt 120 

cagggctctc aggagccggg tcctgggcaa ggcgcagccg ttttcaaatt Ctcaggaaag 180 

cggtcggctc acactcgagc agtaaaaaga tgcctctggg gaggaggccc gtgcagctct 240 

ccgggcaatg gtggtggctc ggcctagaga ggcggtagtg gaacgcagac cctggtgggg 300 

gaatgacatc aagggaggag acgggcggga ccccagattt ctgcctgtgg gcgatggaag 360 

tgaggttcac tggccagcgg agccggacac agaacgcgca aaacgccgtg. taggcctgga 420 

ggagccgaag agcaggcgga ccccctccgc gggggaacag tttccgccgg gagcacaaag 480 

caacggaccg gaagtggggg gcggaagtgc agtgggctca gcgccgactg cgcgcctctg 540 

cccgcgaaaa ctctgagctg gctgacagct ggggacgggt ggcggccctc gactggagtc 600 

ggttgagttc ctgagggacc ccggttctgg aaggttcgcc gcggagacaa gtgagcagtc 660 

tgtgccatag ggat'tctcga agagaacagc gttgtgtccc agtgcacatg ctcgcatcgc 720 

ttaccaggag tgcccgagac cctaagatgt tcggagcggt tttttcgcac agacccgaac 780 

agcctgcccc tcagccacgc tctgtgccct tctgagaaca ggctgat atg ccc aag 83 6 



Met Pro Lys 
1 



ata gtc ctg aat ggt gtg acc gta gac etc cct ttc cag ccc tac aaa 
lie Val Leu Asn Gly Val Thr Val Asp Phe Pro Phe Gin Pro Tyr Lys 
5 10 15 . 



884 



tgc caa cag gag tac atg acc aag gtc ctg gaa tgt ctg cag cag aag 
Cys Gin Gin Glu Tyr Met Thr Lys Val Leu Glu Cys Leu Gin Gin Lys 
20 25 30 35 



932 



gtg aat ggc ate ctg gag age cct acg ggt aca ggg aag acg ctg tgc 
Val Asn Gly He . Leu Glu Ser Pro Thr Gly Thr Gly Lys Thr Leu Cys 
40 45 50 



980 



ctg ctg tgc acc acg ctg gcc tgg cga gaa cac etc cga gac ggc ate 
Leu Leu Cys Thr Thr Leu Ala Trp Arg Glu His Leu Arg Asp Gly He 
55 60 65 



1028 



tct gcc cgc aag att gcc gag agg- gcg eaa gga gag ctt ttc ccg gat 
Ser Ala Arg Lys He Ala Glu Arg Ala Gin Gly Glu Leu Phe Pro Asp 
70 75 80 



1076 
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egg gcc ttg tea tec tgg ggc aac get get get get get gga gac ccc 1124 
Arg Ala Leu Ser Ser Trp Gly Asn Ala Ala Ala Ala Ala Gly Asp Pro 
85 90 95 

ata get tgc tac aeg gac ate eca aag att att tac gcc tec agg aec 1172 
He Ala Cys Tyr Thr Asp He Pro Lys He He Tyr Ala Ser Arg Thr 
100 105 110 115 

eac teg caa etc aea eag gtc ate aac gag ett egg aac acc tec tac 1220 
His Ser Gin Leu Thr Gin Val He Asn Glu Leu Arg Asn Thr Ser Tyr 
120 125 130 

. egg ect aag gtg tgt gtg etg ggc tee egg gag eag etg tgc ate cat 1268 
Arg Pro Lys Val Cys Val Leu Gly. Ser Arg Glu Gin Leu Cys He His 
135 140 145 

cet gag gtg aag aaa caa gag agt aac cat eta cag ate eac ttg tgc 1316 
Pro Glu Val Lys Lys Gin Glu Ser Asn His Leu Gin He His Leu Cys 
150 155 160 

cgt aag aag gtg gca agt cgc tec tgt cat ttc tac aac aac gta gaa 1364 
Arg Lys Lys Val Ala Ser Arg Ser Cys His Phe Tyr Asn Asn Val Glu 
165 170 175 

gaa aaa age etg gag eag gag etg gcc age ccc ate etg gac att gag 1412 
Glu Lys Ser Leu Glu Gin Glu Leu Ala Ser Pro He Leu Asp He Glu 
180 185 190 195 

gac ttg gtc aag age gga age aag cac agg gtg tgc cet tac tac etg 1460 
Asp Leu Val Lys Ser Gly Ser Lys His Arg Val Cys Pro Tyr Tyr Leu 
200 205 210 

tec egg aac etg aag cag caa gcc gac ate ata ttc atg ccg tac aat 1508 
Ser Arg Asn Leu Lys Gin Gin Ala Asp He He Phe Met Pro Tyr Asn 
215 220 225 

tac ttg ttg gat gcc aag age cgc aga gca cac aac att gac etg aag 1556 
Tyr Leu Leu Asp Ala Lys Ser Arg Arg Ala His Asn He Asp Leu Lys 
230 235 240 

ggg aca gtc gtg ate ttt gac gaa get cac aac gtg gag aag atg tgt 1604 
Gly Thr Val Val He Phe Asp Glu Ala His Asn Val Glu Lys Met Cys 
245 250 255 



gaa gaa teg gca tec ttt gac etg act ccc cat gac etg get tea gga 
Glu Glu Ser Ala Ser Phe Asp Leu Thr Pro His Asp Leu Ala Ser Gly 
260 265 270 275 



1652 



etg gae gtc ata gac cag gtg etg gag gag eag acc aag gca gcg cag 1700 
Leu Asp Val He Asp Gin Val Leu Glu Glu Gin Thr Lys Ala Ala Gin 
280 285 290 

eag ggt gag ccc cac ceg gag ttc age gcg gac tee ccc age eca ggg 1748 
Gin Gly Glu Pro His Pro Glu Phe Ser Ala Asp Ser Pro Ser Pro Gly 
295 300 305 

etg aac atg gag etg gaa gac att gea aag etg aag atg ate etg etg 1796 
Leu Asn Met Glu Leu Glu Asp He Ala Lys Leu Lys Met He Leu Leu 
310 315 320 
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cgc ctg gag ggg gcc ate gat get gtt gag ctg cct gga gac gac age 1844 
Arg Leu Glu Giy Ala lie Asp Ala Val Glu Leu Pro Gly Asp Asp Ser 
325 330 335 

ggt gtc aec aag cca ggg age tae ate ttt gag ctg ttt get gaa gcc 1892 
Gly Val Thr Lys Pro Gly Ser Tyr He Phe Glu Leu Phe Ala Glu Ala 
340 345 350 355 

eag ate acg ttt cag acc aag ggc tgc ate ctg gac teg ctg gae eag 1940 
Gin lie Thr Phe Gin Thr Lys Gly Cys He Leu Asp Ser Leu Asp Gin 
360 365 370 

ate ate cag cae etg gca gga cgt get gga gtg tte ace aac acg gee 1988 
He lie Gin His Leu Ala Gly Arg Ala Gly Val Phe Thr Asn Thr Ala 
375 380 385 

gga etg cag aag etg geg gae att ate cag att gtg tte agt gtg gae 2036 
Gly Leu Gin Lys Leu Ala Asp He He Gin He Val Phe Ser Val Asp 
390 395 400 

ccc.tee gag ggc age eet ggt tec cca gca ggg ctg ggg gee tta cag 2084 
Pro Ser Glu Gly Ser Pro Gly Ser Pro Ala Gly Leu Gly Ala Leu Gin 
405 410 415 

tee tat aag gtg cae ate cat cct gat get ggt cae egg agg acg get 2132 
Ser Tyr Lys Val His He His Pro Asp Ala Gly His Arg Arg Thr Ala 
420 425 430 435 

eag egg tet gat gee tgg age acc act gca gee aga aag ega ggg aag 2180 
Gin Arg Ser Asp Ala Trp Ser Thr Thr Ala Ala Arg Lys Arg Gly Lys 
440 445 450 

gtg ctg age tae tgg tgc tte agt ecc gge cae age atg cae gag etg 2228 
Val Leu Ser Tyr Trp Cys Phe Ser Pro Gly His Ser Met His Glu Leu 
455 460 465 

gtc cge cag gge gtc cge tee etc ate ctt aee age gge. aeg etg gcc 2276 
Val Arg Gin Gly Val Arg Ser Leu He Leu Thr Ser Gly Thr Leu Ala 
470 475 480 

ccg gtg tee tec ttt get etg gag atg eag ate cct tte eea gtc tgc 2324 
Pro Val Ser Ser Phe Ala Leu Glu Met Gin He Pro Phe Pro Val Cys 
485 490 495 

etg gag aac cca cae ate ate gae aag cae cag ate tgg gtg ggg gtc 2372 
Leu Glu Asn Pro His He He Asp Lys His Gin He Trp Val Gly Val 
500 505 510 515 

gtc ccc aga gge cee gat gga gee eag ttg age tec geg ttt gae aga 2420 
Val Pro Arg Gly Pro Asp Gly Ala Gin Leu Ser Ser Ala Phe Asp Arg 
520 525 530 

egg ttt tec gag gag tgc tta tec tee etg ggg aag get ctg gge aac 2468 
Arg Phe Ser Glu Glu Cys Leu Ser Ser Leu Gly Lys Ala Leu Gly Asn 
535 540 545 

ate gcc cgc gtg gtg ccc tat ggg etc etg ate tte tte eet tec tat 2516 
He Ala Arg Val Val Pro Tyr Gly Leu Leu He Phe Phe Pro Ser Tyr 
550 555 560 



3 
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cct gtc atg gag aag age ctg gag ttc tgg egg gcc cgc gac ttg gcc 2564 
Pro Val Met Glu Lys Ser Leu Glu Phe Trp Arg Ala Arg Asp Leu Ala 
565 570 575 

agg aag atg gag gcg ctg aag ccg ctg ttt gtg gag ccc agg age aaa 2612 
Arg Lys Met Glu Ala Leu Lys Pro Leu Phe Val Glu Pro Arg Ser Lys 
580 585 590 595 

ggc age ttc tee gag aec ate agt get tae tat gea agg get gee gcc 2660 
Gly Ser Phe Ser Glu Thr lie Ser Ala Tyr Tyr Ala Arg Val Ala Ala 
600 605 610 

cet ggg tee ace ggc gee aec ttc ctg gcg gtc tgc egg ggc aag gcc 2708 
Pro Gly Ser Thr Gly Ala Thr Phe Leu Ala Val Cys Arg Gly Lys Ala 
615 620 625 

,agc gag ggg ctg gac ttc tea gac aeg aat ggc cgt ggt gtg att gtc 2756 
Ser Glu Gly Leu Asp Phe Ser Asp Thr Asn Gly Arg Gly Val lie Val 
630 635 640 

aeg ggc etc ccg tac cec cca cgc atg gac eec egg gtt gtc etc aag 2804 
Thr Gly Leu Pro Tyr Pro Pro Arg Met Asp Pro Arg Val Val Leu Lys 
645 650 655 

atg cag ttc ctg gat gag atg aag ggc eag ggt ggg get ggg ggc eag 2852 
Met Gin Phe Leu Asp Glu Met Lys Gly Gin Gly Gly Ala Gly Gly Gin 
660 665 670 675 

ttc etc tet ggg cag gag tgg tae egg eag cag gcg tee agg get gtg 2900 
Phe Leu Ser Gly Gin Glu Trp Tyr Arg Gin Gin Ala Ser Arg Ala Val 
680 685 690 

aae cag gee ate ggg cga gtg ate egg cac cgc cag gac tae gga get 2948 
Asn Gin Ala He Gly Arg Val He Arg His Arg Gin Asp Tyr Gly Ala 
695 700 705 

gtc ttc etc tgt gac eac agg ttc gee ttt gcc gac gea aga gcc caa 2996 
Val Phe Leu Cys Asp His Arg Phe Ala Phe Ala Asp Ala Arg Ala Gin 
710 715 720 

ctg ece tec tgg gtg cgt ccc eae gtc agg gtg tat gae aac ttt ggc 3044 
Leu Pro Ser Trp Val Arg Pro His Val Arg Val Tyr Asp Asn Phe Gly 
725 730 735 

cat gtc ate cga gac gtg gcc cag ttc ttc cgt gtt gcc gag cga act 3092 
His Val He Arg Asp Val Ala Gin Phe Phe Arg Val Ala Glu Arg Thr 
740 745 750 755 

atg cca gcg ccg gcc ccc egg get aca gea eec agt gtg cgt gga gaa 3140 
Met Pro Ala Pro Ala Pro Arg Ala Thr Ala Pro Ser Val Arg Gly Glu 
760 765 770 

gat get gtc age gag gee aag teg cct ggc ccc ttc ttc tee aec agg 3188 
Asp Ala Val Ser Glu Ala Lys Ser Pro Gly Pro Phe Phe Ser Thr Arg 
775 780 785 

aaa get aag agt ctg gac ctg cat gtc ccc age ctg aag cag agg tee 3236 
Lys Ala Lys Ser Leu Asp. Leu His Val Pro Ser Leu Lys Gin Arg Ser 
790 795 800 



4 
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tea ggg tea cca get gcc ggg gac ccc gag agt age ctg tgt gtg gag 3284 
Ser Gly Ser Pro Ala Ala Gly Asp Pro Glu Ser Ser Leu Cys Val Glu 
805 810 815 



cat gag cag gag cca gtt cct gcc egg cag agg ccc agg ggg ctg ctg 3332 
Tyr Glu Gin Glu Pro Val Pro Ala Arg Gin Arg Pro Arg Gly Leu Leu 
820 825 830 835 



gcc gcc ctg gag cac age gaa cag egg gcg ggg age cct ggc gag gag 33 80 

Ala Ala Leu Glu His Ser Glu Gin Arg Ala Gly Ser Pro Gly Glu Glu 

840 845 850 

cag gcc cac age tgc tec aec ctg ccc etc ctg tct gag aag agg ccg 3428 

Gin Ala His Ser Cys Ser Thr Leu Ser Leu Leu Ser Glu Lys Arg Pro 

855 860 865 



gca gaa gaa ccg cga gga 

. Ala Glu Glu Pro Arg Gly 
870 

ccg gag gag ccc gtg get 

Pro Glu Glu Pro Val Ala 
885 

atg gtg gcc gtg aag cag 

Met Val Ala Val Lys Gin 

900 905 



ggg agg aag aag ate egg 
Gly Arg Lys Lys lie Arg 
875 

ggt gea cag acg gac agg 
Gly Ala Gin Thr Asp Arg 
890 895 

gag ttg age caa gee aac 
Glu Leu Ser Gin Ala Asn 
910 



ctg gte age cac 3476 

Leu Val Ser His 

880 

gcc aag etc ttc 3524 
Ala Lys Leu Phe 



ttt gcc ace ttc 3572 
Phe Ala Thr Phe 
915 



ace cag gcc ctg cag gac tac aag ggt tee gat gac ttc gee gcc ctg 3620 

Thr Gin Ala Leu Gin Asp Tyr Lys Gly Ser Asp Asp Phe Ala Ala Leu 

920 925 930 

gcc gee tgt etc ggc ccc etc ttt get gag gac ccc aag aag cac aac 3668 

Ala Ala Cys Leu Gly Pro Leu Phe Ala Glu Asp Pro Lys Lys His Asn 
935 940 945 

ctg etc caa ggc ttc tac cag ttt gtg egg ccc cac cat aag cag cag 3716 

Leu Leu Gin Gly Phe Tyr Gin Phe Val Arg Pro His His Lys Gin Gin 
950 955 960 

ttt gag gag gte tgt ate cag ctg aea gga cga ggc tgt ggc tat egg 3764 

Phe Glu Glu Val Cys lie Gin Leu Thr Gly Arg Gly Cys Gly Tyr Arg 

965 970 975 



cct gag cac age att ccc cga agg cag egg gea cag ccg gte ctg gac 3812 
Pro Glu His Ser He Pro Arg Arg Gin Arg Ala Gin Pro Val Leu Asp 
980 ^ 985 990 995 



ccc act gga aga acg gcg ccg gat ccc aag ctg aec gtg tec acg get 3860 

Pro Thr Gly Arg Thr Ala Pro Asp Pro Lys Leu Thr Val Ser Thr Ala 
1000 1005 1010 

gca gcc cag cag ctg gac ccc caa gag cae ctg aac cag ggc agg ccc 3908 

Ala Ala Gin Gin Leu Asp Pro Gin Glu His Leu Asn Gin Gly Arg Pro 

1015 1020 1025 



cac ctg teg cec agg cca ccc eea aea gga gac cct ggc age caa cea 3956 
His Leu Ser Pro Arg Pro Pro Pro Thr Gly Asp Pro Gly Ser Gin Pro 
1030 1035 1040 . 



5 
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cag tgg ggg tct gga ,gtg ccc aga gca ggg aag cag ggc cag cac gcc 
Gin Trp Giy Ser Gly Val Pro Arg Ala Gly Lys Gin Gly Gin His Ala 
1045 1050 1055 



4004 



gtg age gcc tac ctg get gat gcc cgc agg gcc ctg ggg tec gcg ggc 4052 
Val Ser Ala Tyr Leu Ala Asp Ala Arg Arg Ala Leu Gly Ser Ala Gly 
1060 1065 1070 1075 

tgt age eaa etc ttg gea gcg ctg aca gee tat aag caa gac gae gac 4100 
Cys Ser Gin Leu Leu Ala Ala Leu Thr Ala Tyr Lys Gin Asp Asp Asp 
1080 1085 1090 

etc gac aag gtg ctg get gtg ttg gcc gee ctg ace act gca aag eea 4148 
Leu Asp Lys Val Leu Ala Val Leu Ala Ala Leu Thr Thr Ala Lys Pro 
1095 1100 1105 

gag gae tte ccc ctg ctg cae agg ttc age atg ttt gtg egt eea cac 4196 
Glu Asp Phe Pro Leu Leu His Arg PheSer Met Phe Val Arg Pro His 
1110 1115 1120 

cae aag cag egc tte tea cag aeg tge aca gac ctg acc gge egg ccc • 4244 
His Lys Gin Arg Phe Ser Gin Thr Cys Thr Asp Leu Thr Gly Arg Pro 
1125 1130 1135 



tac ceg gge atg gag eea ccg gga ccc cag gag gag agg ctt gcc gtg 
Tyr Pro Gly Met Glu Pro Pro Gly Pro Gin Glu Glu Arg Leu Ala Val 
1140 1145 1150 1155 



4292 



cct cet gtg ctt ace cae agg get ccc eaa eea gge ccc tea egg tee 4340 
Pro Pro Val Leu Thr His Arg Ala Pro Gin Pro Gly Pro Ser Arg Ser 
1160 1165 1170 

gag aag ace ggg aag aee cag age aag ate teg tec ttc ctt aga cag 4388 
Glu Lys Thr Gly Lys Thr Gin Ser Lys He Ser Ser Phe Leu Arg Gin 
1175 1180 1185 

agg cea gca ggg act gtg ggg gcg gge ggt gag gat gca ggt ccc age 4436 
Arg Pro Ala Gly Thr Val Gly Ala Gly Gly Glu Asp Ala Gly Pro Ser 
1190 1195 1200 

eag tee tea gga cct ccc cae ggg cet gca gea tet gag tgg ggc etc 4484 
Gin Ser Ser Gly Pro Pro His Gly Pro Ala Ala Ser Glu Trp Gly Leu 
1205 1210 1215 

tag gatgtgccea gectgeeaea ccgcctccag gaagcagagc gteatgeagg 4537 



tcttctggec agageeccag tgagtgecca cggaggecce cagcaeaccc aacgtggctt 4597 

gatcacctge ctgtccagct ctggtgggcc aagaacccac ceaacagaat aggccagccc 4657 

atgccagccg gcttggcccg ctgcaggcct eaggcaggcg gggcceatgg ttggtccctg 4717 

cggtgggaec ggatctgggc ctgcetetga gaagccetga gctaecttgg ggtctggggt 4777 

gggtttetgg gaaagtgctt ceccagaact tecctggctc etggectgtg agtggtgcca 4837 

eaggggcaec ceagctgagc ecctcacegg gaaggaggag aecceegtgg gcacgtgtce 4897 

aettttaate aggggaeagg gctetetaat aaagctgctg gcagtgcce 4946 
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<210> SEQ ID NO: 2 

<211> LENGTH: 1219 

<212> TYPE: PRT 

<213> ORGANISM :Homo sapien 



<400> SEQ ID NO: 2 



Met 


Pro 


Lys 


lie 


Val 


Leu 


Asn Gly Val 


1 








5 










Pro 


Tyr 


Lys 


Cys 


Gin 


Gin 


Glu Tyr Met 








20 










25 


Gin- 


Gin 


Lys 


Val 


Asn 


Gly 


He 


Leu 


Glu 






35 










40 




Thr 


Leu 


Cys 


Leu 


Leu 


Cys 


Thr 


Thr 


Leu 




50 










55 






Asp 


Gly 


lie 


Ser 


Ala 


Arg 


Lys 


He 


Ala 


65 










70 








Phe 


Pro 


Asp 


Arg 


Ala 


Leu 


Ser 


Ser 


Trp 










85 










Gly 


Asp 


Pro 


He 


Ala 


Cys 


Tyr 


Thr 


Asp 








100 










105 


Ser 


Arg 


Thr 


His 


Ser 


Gin 


Leu 


Thr 


Gin 






115 










120 




Thr 


Ser 


Tyr 


Arg 


Pro 


Lys 


Val 


Cys 


val 




130 










135 






Cys 


lie 


His 


Pro 


Glu 


Val 


Lys 


Lys 


Gin 


145 










150 








His 


Leu 


Cys 


Arg 


Lys 


Lys 


Val 


Ala 


Ser 










165 










Asn 


Val 


Glu 


Glu 


Lys 


Ser 


Leu 


Glu 


Gin 








180 










185 


Asp 


lie 


Glu 


Asp 


Leu 


Val 


Lys 


Ser 


Gly 






195 










200 




Tyr 


Tyr 


Leu 


Ser 


Arg 


Asn 


Leu 


Lys 


Gin 




210 










215 






Pro 


Tyr 


Asn 


Tyr 


Leu 


Leu 


Asp 


Ala 


Lys 


225 










230 








Asp 


Leu 


Lys 


Gly 


Thr 


Val 


Val 


He 


Phe 










245 










Lys 


Met 


Cys 


Glu 


Glu 


Ser 


Ala 


Ser 


Phe 








260 










265 


Ala 


Ser 


Gly 


Leu 


Asp 


Val 


He 


Asp 


Gin 






275 










280 




Ala 


Ala 


Gin 


Gin 


Gly 


Glu 


Pro 


His 


Pro 




290 










295 






Ser 


Pro 


Gly 


Leu 


Asn 


Met 


Glu 


Leu 


Glu 


305 










310 








lie 


Leu 


Leu 


Arg 


Leu 


Glu 


Gly Ala 


He 










325 










Asp 


Asp 


Ser 


Gly 


Val 


Thr 


Lys 


Pro Gly 








340 










345 


Ala 


Glu 


Ala 


Gin 


He 


Thr 


Phe 


Gin 


Thr 






355 










360 




Leu 


Asp 


Gin 


He 


He 


Gin 


His 


Leu 


Ala 




370 










375 






Asn 


Thr 


Ala 


Gly 


Leu 


Gin 


Lys 


Leu 


Ala 


385 










390 








Ser 


Val 


Asp 


Pro 


Ser 


Glu 


Gly 


Ser 


Pro 










405 










Ala. 


Leu 


Gin 


Ser 


Tyr 


Lys 


Val 


His 


He 



420 425 
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Thr 


Val 


Asp 


Phe 


Pro 


Phe 


Gin 


10 










15 




Thr 


Lys 


Val 


Leu 


Glu 


Cys 


Leu 










.30 






Ser 


Pro 


Thr 


Gly Thr Gly 


Lys 








45 








Ala 


Trp Arg 


Glu 


His. 


Leu 


Arg 






50 










Glu 


Arg 


Ala 


Gin Gly Glu 


Leu 




75 










80 


Gly Asn Ala 


Ala 


Ala 


Ala 


Ala 


90 










95 




He 


Pro 


Lys 


He 


He 


Tyr 


Ala 










110 






Val 


He 


Asn 


Glu 


Leu 


Arg 


Asn 








125 








Leu 


Gly Ser 


Arg 


Glu 


Gin 


Leu 






140 










Glu 


Ser 


Asn 


His 


Leu 


Gin 


He 




155 










160 


Arg 


Ser 


Cys 


His 


Phe 


Tyr 


Asn 


170 










175 




Glu 


Leu 


Ala 


Ser 


Pro 


He 


Leu 










190 






Ser 


Lys 


His 


Arg 


Val 


Cys 


Pro 








205 








Gin 


Ala 


Asp 


He 


He 


Phe 


Met 






220 










Ser 


Arg 


Arg 


Ala 


His 


Asn 


He 




235 










240 


Asp 


Glu 


Ala 


His 


Asn 


Val 


Glu 


250 










255 




Asp 


Leu 


Thr 


Pro 


His 


Asp 


Leu 










270 






Val 


Leu 


Glu 


Glu 


Gin 


Thr 


Lys 








285 








Glu 


Phe 


Ser 


Ala 


Asp 


Ser 


Pro 






300 










Asp 


He 


Ala 


Lys 


Leu 


Lys 


Met 




315 










320 


Asp 


Ala 


Val 


Glu 


Leu 


Pro 


Gly 


330 










335 




Ser 


Tyr 


He 


Phe 


Glu 


Leu 


Phe 










350 






Lys 


Gly Cys 


He 


Leu 


Asp 


Ser 








365 








Gly Arg Ala 


Gly Val 


Phe 


Thr 






380 










Asp 


He 


He 


Gin 


He 


Val 


Phe 




395 










400 


Gly 


Ser 


Pro 


Ala Gly Leu 


Gly 


410 










415 




His 


Pro 


Asp 


Ala Gly His 


Arg 










430 







■7 
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Arg Thr 

Arg Gly 
450 
His Glu 
455 

Thr Leu 

Pro Vai 

Val Gly 

Phe. Asp 
530 
Leu Gly 
545 

Pro Ser 

Asp Leu 

Arg Ser 

Val Ala 
610 
Gly Lys 
625 

Val He 

Val Leu 

Gly Gly 

Arg Ala 
690 
Tyr Gly 
705 

Arg Ala 

Asn Phe 

Glu Arg 

Arg Gly 
770 

Ser Thr 
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Gin Arg 

Cys Val 
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Gly Glu 
850 
Lys Arg 
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Val Ser 
Lys Leu 



Ala Gin 
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Lys Val 

Leu Val 

Ala Pro 
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Val Val 
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Arg Arg 

Asn- He 
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Lys Met 
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Gly His 
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Thr Met 
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Glu Asp 

Arg Lys 

Ser Ser 

Glu Tyr 
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Leu Ala 
835 
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Pro Ala 

His Pro 

Phe Met 
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Arg Ser 

Leu Ser 

Arg Gin 
470 
Val Ser 
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Glu Asn 

Pro Arg 

Phe Ser 

Ala Arg 
550 
Val Met 
565 

Lys Met 

Ser Phe 

Gly Ser 

Glu Gly 
630 
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Gin Phe 

Leu Ser 

Gin Ala 

Phe Leu 
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Pro Ser 
725 

Val lie 

Pro Ala 

Ala Val 

Ala Lys 
790 
Gly Ser 
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Glu Gin 

Ala Leu 

Ala His 

Glu Glu 
870 
Glu Glu 
885 

Val Ala 



Asp Ala Trp 
440 

Tyr Trp Cys 
455 

Gly Val Arg 

Ser Phe Ala 

Pro His lie 
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Gly' Pro Asp 
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Glu Glu Cys 
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Val Val Pro 

Glu Lys Ser 

Glu Ala Leu 
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Ser Glu Thr 
600 

Thr Gly Ala 
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Leu Asp Phe 
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Leu Asp Glu 
665 

Gly Gin Glu 
680 

He Gly Arg 
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Cys Asp His 

Trp Val Arg 

Arg Asp Val 
745 

Pro Ala Pro 
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Ser Glu Ala 
775 

Ser Leu Asp 

Pro Ala Ala 

Glu Pro Val 
825 

Glu His Ser 

840 
Ser Cys Ser 
855 

Pro Arg Gly 

Pro Val Ala 

Val Lys Gin 
905 



Ser Thr 

Phe Ser 

Ser Leu 
475 
Leu Glu 
490 

He Asp 

Gly Ala 

Leu Ser 

Tyr Gly 
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Leu Glu 
570 

Lys Pro 

He Ser 

Thr Phe 

Ser Asp 
635 
Pro Arg 
650 

Met Lys 

Trp Tyr 

Val He 

Arg Phe 
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Pro His 
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Ala Gin 

Arg Ala 

Lys Ser 

Leu His 
795 
Gly Asp 
810 

Pro Ala 

Glu Gin 

Thr Leu 

Gly Arg 
875 
Gly Ala 
890 

Glu Leu 
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Pro Gly 
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He Leu 

Met Gin 

Lys His 

Gin Leu 
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Ser Leu 
540 

Leu Leu 

Phe Trp 

Leu Phe 
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Leu Ala 
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860 
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Ala Arg Lys 

His Ser Met 

Thr Ser Gly 
480 

He Pro Phe 

495 
Gin He Trp 
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Ser Ser Ala 

Gly Lys Ala 

He Phe Phe 
560 

Arg Ala Arg 
575 

Val Glu Pro 
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Tyr Ala Arg 

Val Cys Arg 

Gly Arg Gly 
640 

Pro Arg Val 
655 

Gly Gly Ala 
670 

Gin Ala Ser 

Arg Gin Asp 

Ala Asp Ala 
720 

Val Tyr Asp 

735 
Arg Val Ala 
750 

Pro Ser Val 

Pro Phe Phe. 

Ser Leu Lys 
800 

Ser Ser Leu 
815 

Arg Pro Arg 
830 

Gly Ser Pro 

Leu Ser Glu 

He Arg Leu 
880 

Asp Arg Ala 
895 

Ala Asn Phe 
910 



wo 01/42434 



PCT/llS0O/33a65 



Ala Thr 


Phe 


Thr 


Gin 


Ala 
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Gin 
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Ser Asp 


Asp 


Phe 




915 










920 










925 






Ala Ala 


Leu 


Ala 


Ala 


Cys 


Leu 


Gly 


Pro 


Leu 


Phe 


Ala 


Glu Asp 


Pro 


Lys 


930 










935 










940 








Lys His 


Asn 


Leu 


Leu 


Gin 


Gly 


Phe 


Tyr 


Gin 


Phe 


Val 


Arg Pro 


His 


His 


945 








950 










955 








960 


Lys Gin 


Gin 


Phe 


Glu 


Glu 


Val 


Cys 


He 


Gin 


Leu 


Thr 


Gly Arg 


Gly 


Cys 








965 










970 








975 




Gly Tyr 


Arg 


Pro 


Glu 


His 


Ser 


He 


Pro 


Arg 


Arg 


Gin 


Arg Ala 


Gin 


Pro 






980 










985 








990 






Val Leu 


Asp 


Pro 


Thr 


Gly 


Arg 


Thr 


Ala 


Pro 


Asp 


Pro 


Lys Leu 


Thr 


Val 




995 










1000 








1005 






Ser Thr 


Ala 


Ala 


Ala 


Gin 


Gin 


Leu 


Asp 


Pro 


Gin 


Glu 


His Leu 


Asn 


Gin 


1010 








1015 








1020 






Gly Arg 


Pro 


His 


Leu 


Ser 


Pro 


Arg 


Pro 


Pro 


Pro 


Thr 


Gly Asp 


Pro 


Gly 


1025 








1030 








1035 






1040 


Ser Gin 


Pro 


Gin 


Trp 


Gly Ser Gly Val 


Pro 


Arg Ala Gly Lys 


Gin. Gly 








1045 








1050 






1055 


Gin His 


Ala 


Val 


Ser 


Ala 


Tyr 


Leu 


Ala 


Asp 


Ala 


Arg 


Arg Ala 


Leu Gly 






1060 








1065 






1070 




Ser Ala Gly 


Cys 


Ser 


Gin 


Leu 


Leu 


Ala 


Ala 


Leu 


Thr 


Ala Tyr 


Lys 


Gin 




1075 








1080 








1085 






Asp Asp 


Asp 


Leu 


Asp 


Lys 


Val 


Leu 


Ala 


Val 


Leu 


Ala 


Ala Leu 


Thr 


Thr 


1090 








1095 








1100 






Ala Lys 


Pro 


Glu Asp 


Phe 


Pro 


Leu 


Leu 


His 


Arg 


Phe 


Ser Met 


Phe 


Val 


1105 








1110 








1115 






1120 


Arg Pro 


His 


His 


Lys 


Gin 


Arg 


Phe 


Ser 


Gin 


Thr 


Cys 


Thr Asp 


Leu 


Thr 








1125 








1130 






1135 


Gly Arg 


Pro 


Tyr 


Pro Gly Met Glu 


Pro 


Pro Gly 


Pro 


Gin Glu 


Glu 


Arg 






1140 








1145 






1150 




Leu Ala 


Val 


Pro 


Pro 


Val 


Leu 


Thr 


His 


Arg 


Ala 


Pro 


Gin Pro Gly 


Pro 




1155 








1160 








1165 






Ser Arg 


Ser 


Glu 


Lys 


Thr Gly Lys 


Thr 


Gin 


Ser 


Lys 


He Ser 


Ser 


Phe 


1170 








1175 








1180 






Leu Arg 


Gin 


Arg 


Pro Ala Gly Thr Val 


Gly Ala Gly Gly Glu Asp Ala 


1185 








1190 








1195 






1200 


Gly Pro 


Ser 


Gin 


Ser 


Ser Gly Pro 


Pro 


His Gly 


Pro 


Ala Ala 


Ser 


Glu 



1205 1210 1215 



Trp Gly Leu 



<210> SEQ ID NO: 3 

<211> LENGTH: 114793 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 



<400> SEQ ID NO: 3 

Cgaagagctt tgaccaagag gctgtgacga ggccctacga ggactctggc tctcctcctg 60 

ctaagcacac ccaggcaggt gtcctggcag atgaggacca catgcagagc ctcggccagc 120 

ccaccaatgc ccggatatgc aagtgagccc agcctggacc ccccggcgag gcccagcagc 180 

accagcccag gcccgaaaac cttaagaaat gaccagtgtc tgctgcttta agccaccaag 240 

ctctgcggtg gtttgttagg ctgcaagcat ggctaattca gaaactgcca gaaacaagca 300 

ctgctgtccc cagcctggga cacacagcac cgcctctgcg tggggagagg gcacaggcta 360 

agggcacaaa tgccatccca gacccggctc ttgtgtgtgg aaggggccac tgtgccatga 420 

ggcagaggaa accttggcag gaccttatgc cacagcaatc taaaagagaa gaaacaggct 480 

gggcgtggtg gctcatgcct ataatcccag cactttggga ggccaaggtg gtggaCcact 540 

tgaggtcagg agttcaagac cagcctggcc aatatggtga aaccctgtct ctacgaaaaa 600 

tacaaaattt aggcaggcgt ggtggcgggt gcctgtaatc cctgctattc aggaggctga 660 

ggcaagagat ttacttgaac ccaggaggtg gaggctgctg cagtgagctg agatcatgcc 720 

actgcactcc agcctgtgtg acggagtgag acttggtctc aaaaaaaaaa aaggaaacac 780 
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atctgactag Cgtgatctcg caaggaacat tccagacaca gtggagctag aaggttcttc 840 

tccaaacaag gaatccccag gggatcaaat tgttttgcat cggccagaca tggtggctca 900 

agcctgtaac cccagtgctt cgggaggctg aggtgggagg actgcttgag tccaggagtt 960 

caagactagc ttgggcaaca cagtgagagc ccattagcca ggcgtggtgg cacatgcctg 1020 

cagtcccagc actgtactaa aaatctacac ggggccgggc atggtggcac atgcctgtag 1080 

agtcccagct acccaggagg ctgaggcagg acgattcctc gaacccagga ggtcacggct 1140 

gccacgagcc gtgactgtgc cactgcactc cagtctgtgc aacagaacga gactctgctt 1200 

cgaaaaacaa aaaatcattt catgtctcca gtttctccac tggcaaaaga ctctgtcaag 1260 

gtaaaaaatg gttctgaccc acagaaatct aagaaaggaa aaaatataaa aaatagaaaa 1320 

tttaaaaaag agatggtctc agaataaaga ccaacctggg ctatggttgt cactcctccc 1380 

tcacacctta gaaagctttc tggccgcatc tggccaaagg gccaccccgc cccatcttgg 1440 

atcagtgagg tgccttcgaa caagccacct gcccCggagc ccgtcctgtc ttgCctgcca 1500 

ccgcacgctc agtaggggag gggaagtcgc taggttttag ttcaccagtc tctggatcaa 1560 

gacgtgccat aaccaagaag ccccagccac acccagaccc gatgtggcca caaggggtga 1620 

gctgggaagg cccaggaaaa ggcgggaggc ggacgaatgg aaatgtcatt ctgtggccac 1680 

agaaatgatc tcaacgtttt gtaacttcct accaagaggc agtcttagct ctgcccttga 1740 

accagcactt ggtgatgtcg cttgcgtcaa tcaaggcaac agaagtgagc aggaggccca 1800 

ctttcctctg caactgtggg cttacggggc aaagaagtcc aggcctccag gtggaggatc 1860 

acagaccggg caaagcagag gagagccacc cagccgagcc tacctgtgcc tcagactgcc 1920 

tccctccaga gacccctgtg gccaaggcca cccagaccag caggtccttg ccaagctgtc 1980 

agctgacgac aggggttggt gaggccggcc cagaccagca gaaccacgaa ccaaccaaca 2040 

gaatcaaaaa taataacaac tatgtcttgt ctcaagccac taagttctgg atggtttctt .2100 

tctttctttc tcttcttttt ttccggagac gcagtctcac tctgttgccc aggctggagt 2160 

gcagtggcgc aatcttggct cactgcaagc tctgcccccc ggactcacgc cattcccctg 2220 

cctcagcctc ctgagtaact gggacCacag gtgcctgcca ttgggtgttc tcttaaacag 2280 

caaaagaaaa ctgacacaat cataaacaga gcaagcaaga gaactcggca attatttcct 2340 

ctctacttct cactgttctt caaagagtta actcaagcat aagatgtgag caaattcttt 2400 

taacatccta gaaaaaaagc tcctactcag tgttcataaa gcaaagctaa cctacaggag 2460 

ccaccttcca cagtgaccac aggaaaccaa gacagcaagt gggacaccag cctccagggc 2520 

actgcgccag ccgtgcgcct gtgtctgcca ctgccctggt ccgtcactgc caccagccgg 2580 

caagacaccc acagaggaga gcCctaagcc acaactgtgt acgaagacaa ctgtgcagga 2 640 

ttttattact acaacatttt tgttttcttt tttttttttt tttgagactg agtctcgctc 2700 

tgtcacccag gctggagtgc agtggcacaa tctcggctca ctgtaacctc caCcCccctg 2760 

gttcaagcaa ctctcctgct gcagcctccc aactggatta caggcgcccg ccaccacgcc 2820 

tggctaattt ttgtactttt agtagagatg gggtttcacc atgttggcca gactggtctc 2880 

aaattcctga caagtgatcc acccaccctg gcctcccaaa gtgctgggat tacaggtgtg 2940 

agccactgcg cctggcccat ttttgcttat caataaaaat gtacccaatg ttgaactccc 3 000 

cacatttcaa aCgggtaact ccagtgtcct tgatgctcct gcgacatgtt cgCgagactt 3060 

ctcttgggtg tgagagtcca gcatgtgggt ggtctggaca ggagggggag ggaagagtgc 3120 

agagccgggc agggtaaaga gaccccctag gatgtgaagg ccgccctgca tttgtcagac 3180 

tgggcaacac ccactccatc agatggaccc tggtatgggc ggcaagccac ctaggtgccg. 3240 

aggcaagaga ccgagggcac gagctgttcc ggtgtaataa aatgcataaa ataagaatag 3300 

ttatactaga tatagatcat aaatatgatt atatatgaat atcatteatc attagtttgt 3360 

agcaattact ctttattcca atattataat aatccttgcc taagcataac ctaggaaaaa 3420 

ctaggaaatc ataacctagg aaaaactagg ccatacagag ataggagctg aggggacata 3480 

gtgagaactg accagaagac aagagtgcga gccttctgtt atgcctggac agggccacca 3540 

gagggctcct tggtctagcg gtaacgccag catctgggaa gacgcccgtt gccaagtgga 3600 

ccgtggtcta gcggtagcct cagtgtcaag gaaaaacacc cgctacttag caaaccagga 3660 

aagagagtct ccctttcccc gggggagttt agagaagact ctactcctcc acctcttgcg 3720 

gagggcctga catcagCcag gcccgcccgc agttatccgg aggcctaacc gtctccctgc 3780 

gatgctgtgc ttcagtggtc acgctcctag tccgccttca tgttccatcc tgtgcacctg 3840 

gctctgcctt ctagatagca gcagcaaatt agtgaaagta ctgaaagtct ctgataagca 3900 

gaaataatgg cgtaagcggt ctctctctct ctctcctctc tctctgcctc agctgccagg 3960 

aagggaaggg ccccctggcc agtgggcacg tgacccacat gaccttacct atcactggac ■ 4020 

atggttcaca ctccttaccc tgccgct.ttg tcttgtatcc aataaatagc gcaacctggc 4080 

attcggggcc gctaccagtc tccgcgtctt ggtggtagtg gtcccccagg cccagctgtc 4140 

tttttctttt atctttgtct tgtgtcttta tttctacact ctctcatctc cgcatacgag 4200 

gagaaaaccc accaaccctg tggggctggt ccetacaccc tggctttgta gactggagcc 4260 

taggcacgac tcagctgctg tagtgaattg cgatcctcca aacccagcaa ggcacctgca 4320 

ggacatctgg cccagtctcc tcgttgagcc agttcacgaa aaagagactc ttctg.agtga 4380 

catgctaatg ggcaatatga ggactaaatg ggatggtctc caacctggac aaaccaacag 4440 
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taaaagccac tttgcgggga aagaaacttt tccttttttc ttttttttga gacaggatct 4500 

caccctgtca cccaggctgc agtgcagtgg catgaccttg gctcactgca gccCcaacct 4560 

ccctcaggct caagcaatcc tcccgcctca acctcccatg cagctgggac cataggtgca 4620 

tgccaccaca cccaaataat ttttatattt tCCgtagaga cgaggtttca ctatgttgct 4680 

cgggctggtc tcaactcctg ggctcaagca accctcccac ctcagcctcc caaagcgctc 4740 

agactacagg caggagccac caggcctggc caacatagga agaaatttaa atttgaattg 4800 

aatattagaa gagatgaaaa ttcatcaaca tggaaagaca aagatcatta actaaagcca 4860 

aaccagaatg gaagctgtgt gtacagtggg gtcccatgct gggaacgcga ggggcacgtg 4920 

cagggctcca cggtgtggcg acgccccatg ctccctttgt gggggtccat ccagcggaac 4980 

atgaggacct ggggtgcttt ccaacatgta cgtgagttta ataataaaaa ggtttaagga 5040 

aagaaaaatt catatgtttc tatataaaca gaacatctgg aaagatctat tctaaggtgt 5100 

tgacagtagg aatctcCagg tagtagtaac atggcctttt tgaatttttg ctCatcagta 5160 

ttttctaatt ttctttttct ttctaaataa ttctagctat gaaataattt tctaccatac 5220 

atactttgta ataaaaatgg ttatatttaa ttttttaaag gctgtacaaa cttcctgata 5280 

aaatggcaaa ttagacacac acatgtgggc cgggtacagt ggctcgcgcc tgtaattcca 5340 

gcactttggg aggctgaggc aggcagatca cctaaggtca ggagtttgag accagcctgg 5400 

Gcaacatggt gaaaccccgt ctctactaaa tatacaaaaa tgagctggat gtggtggcac 5460 

acacctatag tgccagctac ttgggaagct gaggcaggaa aattgcttca acccgggagg 5520 

cagaggttgt agtgagccga gatcatgcca ccgcactcca gcctaggcaa caagagcgag 5580 

actccaactc aaaaaaaaat aaaaataaca cacacgtgaa taggctcctc atggaagtca 5640 

tcacaacaaC gcagagggaa gagcttccaa agtgtaaacc cagaagcgag gagcaggagg 5700 

gtgcgcgcag acgcagagag cagcaaggtg cagactgaga ggcggaggct ggccgtgggg 5760 

agatgactga tgctcag;ttt acaccccaaa tccgtaaatc tagaggcctg gcacatcaac 5820 

tacccctgcc agcaggaatg agggaaagga gggcaaccaa aagacgtccc accctcaccc 5880 

atccagctac ctgccatcct cagccccact ggcagaagac cctgagaggt ggaggcaggc 5940 

ccctgcctac aggaccctga gagctagggg aaggcgttat cctgaactgt gtcccccgta 6000 

aaattcatat gttgaaggcc tcatccccag tgCgactgta tttaaagatg gggtcttcag 6060 

gagataattt aaatgaggtc atataagttg gccctcatcc agtaagactt tgaccttctg 6120 

gtggtttttt tttttttgga gactgggtct cactctatca ctcaggttgg agtacagtgg 6180 

cacgatcacg gctcaccg.ct gtctccaact cctgggctca ggtgatcccc ctgcctcagc 5240 

ctcctgagta gctgggacta caggCgctta ccaccgcacc cagctggtgg tgcattgtgt 6300 

tttttgtaga gatggggttt tgccatgtcg cccaggctgg tcctgaactg ggctcaagtg 63 60 

atctgtctcc ctcggcctcc tgcagtgctg gaattacagg tatgagccac cgcgcctggc 6420 

cgaccgtgac cttctaagaa gtgaaagaga aagatctttc tctctccctc cctctccatc 6480 

atgaggacac agcaagaagt cggccatctg caaggtagaa agcgagtcct cccaacagct 6540 

gaacctggca gaccctgatc ttggacttca gccttcagag ctgtaagaaa ataactctct 6600 

gctgttcagg ccacgcggtc tacggcagcc cgagcagact aagacacacg ccatctgggg 6660 

agtcagacca gatcaggaag aaaggcctag agctcaggat actgaaggtc ccaacccggt 6720 

gctggaccag accaccccgg cagccgcggc cacggagtca cggctcgggt gaggtgacct 6780 

ggacaccatc ccggcagccg cggccacgga gtcacggctc gggtgaggtg acctggacac 6840 

catcccggca gccgcggcca cggtgtcacg gctcggatga gatgactcgg acaccacccc 6900 

ggcagccgcg gccacggtgc cagggctcag gtgaggagag ctggacatgg gactgggcct 6960 

accccgaggc tgcttccacc cagacgcctg ggtgggtgac acgaaagctg ggctcagttg 7020 

ggatcagagc agcctctccc caggtcagaa atgaccctgg gctcctcaca gtagccctag 7080 

ggcaccatga gaaagctacg tggacttctc tgaccaaggg tcactgctgc cacactactc 7140 

attgcaggcc atgtcagggc tcagctgagg agacgtggac accaccccag cagccgcggc 7200 

cacggcgtcc caagggaggg acttgggcac tgcctctctg ggcaagagtg gggaggtgtg 72 60 

gggtgggaga tgtctggaaa catcatggac acatgccggg aaaacacgga agctgtgcac 7320 

caaggtgctg acaaaggaaa aaggagaatg gaggtgtgaa catccagcta gcaggtccca 7380 

ctcagaaact cctgcatttc cagacatggc caccagctct gtggatgaga caggggagga 7440 

cagggtacct cacaccagga acccacacag gtccatgtct tgctctgtga tcacacaaca 7500 

gcctccacca ccctgacacg caggagggag gtcaaagcct cgggtccaac aacaggctcc 7560 

acagcaaggg aagaaaggca ggaaggaact cagggccagg tcctcccagg cagcagctgc 7620 

ctgcacgctg tccaccaagg gaggcctgac ctacaccgca caggggttgg cagtctagag ■ 7680 

tcgtcctctg tcaaacggtg agaaagtcaa aagctcatgc tcagtgaCat gctaggtcag 7740 

catgaagatg ccacacatga gacacagcaa ggatgagacc aacgggaaga ctgccccaga 7800 

ccagagcccc agagccctct ggggaggaag aataaggatg gcagcctggg actgcccggg 7860 

gctgactctg cctttatttc accccagcag aggcaggagt gacaccggct cacagcagga 7920 

gcagctctgc cacctcctag cagttccacc tacgggcagc aaaacaaagc tggcagtttg 7980 

ggcaaatgtt agcgtttttg ccaactaaca tttgaatcgg acatctggta cagagacgag 8040 

gaagaaaaca ctcacagttt catgaagact gtcaagaaaa tcaccgactc ttcacttcat 8100 
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ttatgaaagg ccagctctcc gacatcccta ccactccctc tcacatgaga aatcacggcc 8160 

tttcaggacg tggagccacg tggccaCgca ggtacgggag gcctccccgc agctgcagct 8220 

gggtcttctg gtccccgtgc catttctgct tttcCtcgct ctctacttac acacacattt 8280 

gagtccagtc tcagaagaac tggaactaga aaaatcctga cacttgtccc ttactacgtt 8340 

aatgccagct gtgccaagga cagcccaacc caagccccca tcagccccaa tggcaccgag 8400 

gcccgagctt acccgtgagg ggccaagctg gtcgtcacca acacggtctt caccccGtcc 8460 

acaccactgc cgtccactgc agtgtccgga gttgtcacaa ccaccacctc ctccatgtgc 8520 

acactcacgt cgggagtcgc catggctcag cggaagggga cgcccaggcc agcagcgtca 8580 

gtcctccagg g.tcccaagtc ctggaggaag caaggcaggg cacagggatg gagtcatctc 8540 

cacatccaca caacatagca ctcacaaagg catctctaat cagctccaaa gacccaccct 8700 

tgagtcccag actgctacct cctgacaaaa acgagcggca acagaagggc tactccaggc 8760 

tctggttccg agggcggtgt aagcgcactc cacccgtttt tcccactgga taagccgaaa 8820 

cccttgggta gaaagcacag agccactccc tccacgtggg gctcagagca ggaggacagg 8880 

aggggcctgg aattccaagc aacttccctg gacgcaggct cccggcttgc cagttcttcc 8940 

gtctctcctg gcctgaactc aaagccagcc ccaatccctg aactgagttt caggtgcaga 9000 

aagcactcca agaagtcctc gctggtctgt ggaacgggaa gggaaaccca ttcaagacag 9060 

aaagagagga gggaaacgce ctgggttttt ttgggttttt gggttttttt tgagacggag 9120 

tctcgctctg tcgcccaggc tggaacgcag tggcacgacc tcggctcact gcaagctcca .9180 

cctcctgggt tcaagtgatt ctcctgcctc agcctctcca attgctggga ttacaggttt 9240 

caccatgttg cccaggctgg tctcaaactc ctgacctcag gtgatccact cacctcggcc 9300 

tcccaaagtg ctgggattgc aggtgtgagg caccatgcct ggcctgcccc gggtttaaaa 93 60 

actattatta ttttgtcttt cctggctttg ccttcagcaa gtccaacccc tgctaaaacc 9420 

cggtgataat ggctgtcctg gcccaaaaag cttggagaca ggggaatctt cctcctgact 9480 

aaaggaatgg tggcccaaga gtgtgggggc tccctgttgc cctctcactc tccatcccct 9540 

acctagcaca gggaacacaa aagcccctgg tctccagcca gagggcaacg agcctggagt 9600 

cagagtgtgg gggaggcgac aagaggagag gggagaagag aggatggcac acagctgtgt 9560 

gtgagcgcct gggtcgtccc aagacagtct ctacgtggtc ctgaccctaa agggcaaagg 9720 

gaagaaaact gacctacagg ataggccact gcccaggtct cagatgggcc ccagtggcgc 9780 

atatgggaca gatccacagt gcactggaaa gtctctaaaa taaactggcc taagaacaca 9840 

gacacaggaa cggggtgcaa aatttgcagc ctgaacctaa ccaggtcgat CCcttgctat 9900 

gaaaaaaaaa agtctacatt ctctgtgaaa cttaaaacaa gacctagagt ccatagcaca 9960 

gtagtcaaag catccagaac acgatcaaac ttcctggcaa agggtagtct ggttgattct 10020 

caaaggaaca aatacacaag agaagctggc tcttgaacgc agaatccaga gactttcagg 10080 

tgctatcgga ccagctccaa gaggaaagca aacattgCca accaagtgga aagaaaatct 10140 

tggtatagaa acaggagtta taaccaaaca gaaatgtgaa aattaaaaac gacaaccaaa 10200 

agaaaataca caaagctggg atagtctcag ctacteggaa ggcggggctg gaggatcgtt 102 60 

tgagcctagg agattgaggc tgcaatgagc tgtgatcaca ccaccgcact ccagtctggg 10320 

caacagagtg agaactctct caaaaaacga aaaagaaaga aagtagaaca gaagtgacca 10380 

ggggctgggg gagggagtac agggagttgt tccttaatga gtacagaatt tctgcttggg 10440 

atgatgaaaa gctctggaaa tggacggcgg tgatggctgc acaatcactg tggctgttct 10500 

gaatggtgct gaaccacaca tttaaaaaca gttaaaatgg gctgggcgtg gtggctcacg 10560 

cctgtaatcc cagcacttcg ggaggcggat cgcctgaggt caggagttcg agaccatcct 10620 

ggccaacaca gtgaaatcct gtcttgacta aaaatactaa aaattagcca ggcatggtgg 10680 

caggcacctg tagtcccagc tacttgggag gctggggcag gagacctgct tgaacccagg 10740 

aggcagaggt tgcagtgagc cgagatcgtg ccactgcact ccagcctggg caacaagagc 10800 

gaaactccat ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aagtttaaaa tggttaaatt 10860 

ttatgttatg tatattttac cgtaataaaa acactgtaat gctactataa cagaatgact 10920 

cattaggatt agaCatagac tagaaagtac agaatataaa aactttttaa acaaagaaaa 10980 

attttcatgg ccaggcacgg tgtcacacct gtaatcccag gactttggga ggccaaggca 11040 

agaggaatgc ttgagctcag gggtttgaga ccagcctggg caacacagca acaccccatc 11100 

tctgctaaat aaataataaa aaatagccag gcatggtggt gtgcacgcct gtagttgcag 11160 

ctactctgga ggctgaggca ggaggatcac ttaagcccag gaggtcaagg ctgcagtgag 11220 

ccatggttgt gccactgcgc tccagcctgg gcaacagatc aagaccttgt cacaaaaaaa 11280 

agaaagaaag aaaagaaaaa agaaagaaaa taaaatcttc cagaactttt aaaatcatca 11340 

ttgttaatat aaaaataaca tcacctgccc ctaggactgt aacaaacaag tgtgtctaag 11400 

gacaggagtg ggtccacccc aacctggcac gcagtggtcc cctgcggaga gtctggccct 11460 

gcactcacta agaggaggca ctcatagccc agccaggcct ctgcaattat gccttcaatg 11520 

ccagaactaa ctcacccaaa ctgaacaatc gatcacaaaa tgtgccttca ggtctcaagg 11580 

ttcttgctaa atcttactca accgacattt tccagcatgg gaacattttt ctgaatgtct 11640 

tagggagagg aagtccgcaa gagaacaaaa ggtcctcagg ccaccctagc ttcttttccc 11700 

ccattccaca ggctgtcttt tgtctgggta tgcactggac cagggggctc tacttcttcc 11760 
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tacctgggca tgggtctcca cacaactcca aggtaaaggg ccacaggcaa gataaagggg 11820 

agaaaagaaa gctacgattt cctgggccac caatcgcaaa tggcagccag tctctgaagt 11880 

aacccttgac cagagatcca aggaaccaag aaatgtaggc gacctgaaca gaggggatgg 11940 

tggttaaaca ccatgaagga aagacccatt ctcaaagaaa aggaagcaaa aagaaaccgt 12000 

ggggagctgg gtaccacccg cagcaaagac cccgcacgcg Ctactgacgc cagcctggcc 12060 

tgggagagca gcgagtgtgg cggacggtga gtggcgggga gggctgtggt aggtttaggg 12120 

taagaagggg cagcgcccag agcccagaga acaccagtga gggctccaca ggaacactac 12180 

tcaaagtatt cacggaacac atctaaacac aagcactaag gactaagtgc gagggacaag 12240 

aaaatattcc ccgtttcctg tttcaggagg gtatcgaaaa tgagtgatgg aaggaaaatg 12300 

tattgtttaa atgaggaaaa aaaattttta caaactaaga acatcctgga acatgatgag 12360 

ccgtttactg tcactcaatt taaatggtgg ccatctagga cagagcgcct aaggggaaag 12420 

ggggctcaca ggtgaacccc tccagccgct ggtgggcaat ttcccatcag ggcatcaggg 12480 

tctctgaaga ctgtcttcag atgcttttta gccaggaaag ttacaatgat gaattcgttt 12540 

acactggcgg aattacttcg tatttctcaa atataatgtc ttcactagca taactttgtt 12600 

gttgtagact taggcttcaa aataaagaac tttaaacaaa cacgaataaa aagccacttt 12660 

aggccgggcg cggtggctca cacttgtaat cccagcaccc tgggaggccg cggcgggcgg 12720 

atcataaggt cagaagttca aagaccagcc tgatcaatac ggtgaaaccc cgtctctact 12780 

aaaaatacaa aaattagccg ggcgcggtgg caggtgcctg taatctcagc tacttgggag 12840 

gctgaggcag gagaatcgct tgaacctggg cagcagaggt tgcagtgagc caagatcatg 12900 

ccactgcact caagcctggg tgacagagtg agactctctc ttaaaaaaaa aaagccactt 12960 

taaaatttta ctcaggccag gtgtggtggc tcacgcccat aatcctagca ctttgggagg 13020 

ccgaggcgag cagatcaccc gaggtcagga gttagaccag cctggccaac atggcaaaac 13080 

cttgtctcta ctgaaaacac aaaaattagc tgggcgtggt ggtgtgccca tgtaatccca 13140 

gctactcagg aggctgaagt gagagaactg cttgaacccg ggaggcagag gctgcagtgt 13200 

gccaagactg caccactaca cttcagcctg ggcgacagag caagaccctg tctcagaaaa 13260 

aaaaaaaatt caaaaatttg gccaggcgtg gtggctcacg cctgtaatcc catcactttg 13320 

gaaggccgag gcgggtggat cacctgaggt caggaattca agaccagcct ggccaccatg 13380 

atgaaaccct gtctctacta aaaatacaaa aaaaaaaaaa caaattggcc gggcatggtg 13440 

gcgggtgcct gtaatcccac ctacttggga ggctgaggca ggagaatctc tcgaactccg 13500 

gaggcagagg ttgcagcgag ccaagattgt gccactgcac tccagcctag acaacagagc 13560 

gagactctgt ctcaaaaaaa aaaaaattaa aattaaaaaa taaaaatttc atttaaaata 13620 

ctactgatct cccgtgctga cttctcgggg tttaactctc actgaggaga cgctgctttc 13680 

ataagggtaa gctcagcagg ggcaactaaa gtcatttaag cagagagctg caaagaggca 13740 

acagcctcac tgcaggcagg ggtcctcgtc acagcttcag ggctttgcag aggattacgc 13800 

aatgtacacg cacaaaactg aattccagcc tctccattgg caactgcata catacatata 13860 

tccttttttc gagacggagt ctcgctctgt agcccaggtt ggactgcagt ggcccgacct 13920 

cggctcaatg caagctctgc ctcccgggtt caagcgattc tcttgcctca gcctcctgag 13980 

tagctgggat tacaggcgcc caccaccacg cccggctaat ttttgtattt ctagtagaga 14040 

cggggtttca ccatgttggc caggacagtc tcgatctcct gacctcgtga cccgcccgcc 14100 

tctgcctccc aaagtgctgg gattacaggc gtgagccact gagcctggcc tccaatggca 14160 

actatattaa aggttcaaag caatatgcac aaaagttacc tcacagaaaa tagtgcaagt 14220 

ccttgataca atgctcttta gacacagaag aagcactata gaatagagca cctcgcccta 14280 

ttgccttccc aagggcgagc accccctcct ccctccacag ctccttcttt gtttttttga 14340 

gatggagtct cgctctgtca cccaggctgg agtgcaatgg caaaaccctg gctcactgca 14400 

acctccgcct cccgggttga agtgattccc ctgcctcagc ctcccgagta gctgggacta 14460 

caggcaccca acacgcctag ctaatttttg catttttggt agagacgggg tttcatcatg 14520 

ttggccaggc tggtctcgaa ctcctgacct ccagtgatcc tcccaccttg acctcccata 14580 

gCgctgggat tataggtgtg agccactaca cctggcctct ccacagcccc ttctgtgttg 14640 

aagccaagac ccacccagct ttgatcccaa ggcttgggtt ccccactagt gtgaagcgag 14700 

tttccaaatt attaggtaaa tcagatatga gaaaatattt tattttactt tttttttttt 14760 

gagacgcaat cttgctccgt cacccaggct ggagtgcaat ggcaccatct ccactcacCg 14820 

caacctctgc cttctgggtt caagcaattc tcctgcctca gcctcccaac tagctgggat 14880 

tacaagtgca caccaccacg cccggctaac ttttgtattt ttagtagaga cagggtttca 14940 

ccgtgttagc caggctgctc tcaaactcct gacctcatga tccgcccacg tcgggctccc 15000 

aaagtggtgg gattacaggt gtgagccatc acacctggcc caagaaaata tttttaaact 15060 

agtattcttg accggcacgg tcaacactga tgtaattgaa actgttgtat ttgaagtgtt 15120 

agcaaagaaa gagaattctg gttcaacaga aaagtcagtc acgacttttc agtcacgcat 15180 

gaattacaca gtaaccaaat agataacatg ccatgactga cgacgggccc acaacaaatc 15240 

agctccgacc aacagggtcc acaccaccat gggtctacac agatccaggt cccgcctgtg 15300 

agcctacagt gacgcgggcc cctgtggggt ggtccctgca ggtcaggtcc ctgagagtgg 15360 

gtcccagtgg ggtgatccct gcgggtcgcg tccctgcgag ttgggtgcct gccgggtggc 15420 
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ccctgcgggt cgggtgcctg cggggtggtc cctatgggtc gcgtccctgc gggtcgggtg 15480 

cctgcggggt ggcccctggg aatcgcgtcc ctgcgggtcg ggtgcctgcg gggtggcccc 15540 

Cggggatcgc gtccctgcgg gtcgggtgcc tgcggggtgg cccctgggga tcgcgtccct 15600 

gcgggtcggg tgcctgcggg gtggtccttg tgggtcgcgt ccctgtgggg tggtccctgt 15660 

gggtcgcgtc cctgtggggt ggcccctgcg ggccgcgtgg tggcccctgc gggtcgggtg 15720 

cctgcggggt ggtccctgtg ggtcgcgtcc ctgcgggtcg ggtgcctgcg gggtggtccc 15780 

tgcgggtcgc acccctgcgg cgtggtcccc ccgggatggg tccaccgagg aggccgctgg 15840 

aggccgagcc cgcgcccgcc cgcggcgcca agatggaggc aggaagcgcc gccgcccgcg 15900 

cccgccaccg cccgcgccgc ccgcctgacg ccgccgttgc gcctgacgcc gccgcccgcg 15960 

cggccgcccc tcccccggcc ctcccctccc cccgccgtaa cgtcctgacg ctccgcaggg 16020 

acccctgact ggacggcggc gcgtgagcgg agcgagaggc ctcgccgcgg gggggccgcg 16080 

ggctcgccgg cgccgcttac ctggggccgc gccgggcctg cttaggcacc cggcgggggc 16140 

ggcggcgtcg ggagctgcgg cggcggcggg cggcggcggc ggccgcgggc ttcgctcctt 16200 

gttggggatt cggcggcggc ggcggcgcgg gcgcgcgctt cctagtgacg caggcggcgg 16260 

ggccgcgcac gcacggggct gggagggccg gacacttatt tggcgctcgc ggaggaggaa 16320 

ggcggggccg tgaaataagg cccgacgggc cccggggcgc gtgcgcggac cgacactgtc 16380 

agctcctaac gccgcaggtt cctcctggtc cccgaggccc ccggtcgggc gttgcctgcc 16440 

ccgcgcgggc ggccgggccg agggacgatg gtcagtggac ggacggcgcc agggagcagt 16500 

gcccacgcgc ggcagggcgg taccttcagg cctccaggta cgggcgctcc tcgcccggac 16560 

gctgctgtgt gtgaatgggc gcgaggggac tcccctgcgg ggcggacgcc tgaacacgag 16620 

gctgtggagg aggacgctgt agggtgcgcg gactcacgcg gaacatgcca gaggctcagc 16680 

cagccacggc gctcccagcg tggagggcga ggggcatccg ggagcggccg ggagggctcg 16740 

gtcacccctc aagctgtcac cccagtccca caaccagcac cccgatccta tcgcagtccc 16800 

acagccgaca ccccgatccc acccctgccc aacagccggc acccacccca atcccatagc 16860 

taacaccccg gtcccaccgc tgtcccacgg ccggcacccc gatcccaccc cagtcccgca 1692 0 

gctggcaccc cgatcccacc ccagcccaac agctggcacc caccccgatc ccaccgctgt 16980 

cccacagccg gcaccccgat cccaccccag tcccgcagcc ggcaccccga tcccacagcc 17040 

ggcactcacc ccgatcgcat agcatagctg ataccccgat cccaccccag tcccatagcc 17100 

agcaccccga tcccacccca gtcccatagc cagcacctcg atcccataga tgacaccccg 17160 

atcacgcccc agtcctatag cccgcacccc gatcccaccc gagtcccgca gccggcaccc 17220 

catcccaccc atgtcccaca gtcggcaccc cgatcccact cggatccggc agccagcttg 17280 

gatcctgtgg ccctcctcca gcccccaggg ctcatttata tgttttattg gcagaggctg 17340 

gggctggctc tgttggcctc tgtgctgggt tccttcctct gcaccgcagg actggctctc 17400 

ctgacctctc caggtgtcat cgaacaccct tgtgcttgct gtcacccgct gcctgtctgc 17460 

aggatcccgg attccgtatc aggggaccga aattagtcgg aaaataggaa gcaggtgctc 17520 

gcttggatgg aaccctgacc ctgtgctcac acttgtagga ggagggctct gcaggccgcc 17580 

tcccggaacg ggaggttccc aagccactgc acttcggagg ggctgtaatt agagttgcac 17 640 

attcattcag ttcccagtaa agtagaacgt gctccagcca gtgaggaaaa ggtgttttta 17700 

aaaattagat tggccgagtg cggtggctca tgccttttac ctcaacactt tgggagacaa 177 60 

aggtgggagg atcacctgtg gccaggagtt caagaccagc ctgggcaaca gagcctgtct 17820 

ctggggaaga ataaaaaaaa aaattgagcc tttgtcagtg ctactatttt attatctggt 17880 

aaatatgaga gggttcacgc ggtctatgtg cgtcatttat ctgagtttgc ctatcgtcac 17940 

gttttggaaa taaatgtcaa taaagtcgaa gaggagtgct gaggggggcc tggggatggg 18000 

agggtggcta catcatgcct gtgtgttgcg caagcccacc gaggtcggcc tggggtgagc 18060 

cctggggcct gttctgcctc cttcactctg gggctccaag agacaaactg ggcaacaaga 18120 

gagaaactcc atctaaaaaa aaagaaaaat cacctccaag ataacttagc tttcttctgc 18180 

tggcataaca aattatctca aacttagtcg cttaaaaatg caaatttagg ctgagtgcgg 18240 

aggctcacgc ccataatcct agcactttgg gaggccaagg caggattgct tgaggccagg 18300 

agttcgagac caacatggcc agaactgtct ctttttaaaa aatgcaaatg tgtccggcac 18360 

ggtggctcac gcctataatc ccagcacttt gtgaggccaa ggcgggcaga tcacgaggtc 18420 

aggagataga gaccatcctg gctaacactg. tgaaaccccc tctctactaa aaatacaaaa 18480 

aattagcctg gcgtggtggc aggcgcctgt agtcccagct actcgggagg ctgaggcagg 18540 

agaatggcgt gaacccagga agcggagctt gcagtgagcc gagatggcgc cactgcactc 18600 

cagcctaggc aacagagcaa gactccgtct caaaaaataa ataaataaaa ctgcaaatgt 18660 

attctctaac tgttctgtag gtcggaagtc cagcccagcc tcactccgcc aaaatcaggg 1872 0 

tgtctgcagg gccgattgct tttggagctc caggggagaa gctgttctgg cctttccagt 18780 

ttctggaagc acttgagccc cttgtctcgt ggcctatccc acacctgaaa gccagccaaa 18840 

gccagttgag ccctcaccct gttggccccg acactgatct cctgcctccc tcatctgctg 18900 

tcaaggcccc ttgtgatgac atggggccac cagctggccc agggcacctc ctgtcagagt 18960 

ccgccgacca gtgaccttca ttccatctgt cgctgtaatt cccctttgct tggaaccaac 19020 

gttcacagat cccaggggtt aggatgtgaa tatcttgggc agggctgtgg gggggctatt 19080 
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cCtccttcta 
agacagagtc 
aacctctgcc 
ttcgctctgt 
gcactctggg 
aacacggtga 
cgcctgtagt 
ggagcttgca 
tccgtctcaa 
ccaggctgga 
tatttttagt 
caggtaatcc 
tctggccata 
ctcactctgt 
ctcccaggtt 
cgctaccaag 
gaggctggtc 
gattataggc 
gggttggctg 
gaggatcgct 
ccgcaaaagc 
ggaggctgac 
tgtgccactg 
gtccaggcac 
ttcgttgagg 
aaaatacaaa 
gactgaggca 
gccactgcac 
ttgagtcctg 
gaacaatctg 
agtggctcac 
aagagatcaa 
aattagccgg 
agaatggcgt 
cagcctgggc 
tggtggctca 
cctgggaggc 
agagcgagac 
aagtcagagt 
atgtccctga 
gaacagtggc 
ggagttagag 
ttagctgggc 
atcgcttgaa 
cctgggcaac 
ttgcctgcag 
atgggcagat 
acctctacta 
agctgaggca 
gccaccacac 
aaacataggt 
cctgtcaCgt 
cagccacagt 
tagcacccat 
cagatcctcc 
tgcaagggat 
ctcgtcaccc 
gggttcaagt 
ccaggcctga 
tggtcttgaa 
ttcaggcata 



aaatatctat 
tcgctctgtc 
cccgggcaga 
ttttgagatg 
aggccgaggc 
aaccccgtcc 
cccagctact 
gtgagccgag 
aaaaaaaaaa 
gtgtagtggc 
agagacaggg 
acccgcctca 
tttattaact 
cacccaggct 
caagcgattc 
cccagctaat 
tcgaactcct 
atgagccact 
ggcatggtgg 
tgagtccagg 
taaaaagcta 
gcgggaggat 
cactcagccc 
agtggctcac 
tcaggagttc 
aaaattagcc 
ggagaatcat 
ttcagcctgg 
agttcctcta 
atgaggaaga 
gcctgtaatc 
gaccatcctg 
gcgtggtggt 
gaacccagga 
gacagagcct 
cacctgtaat 
ggaggttgta 
tctgtctcaa 
cacaaggtgt 
gttatttctc 
tataatttca 
accagcctgg 
atggtggtgc 
cctggtagga 
agagtgagac 
gccgggcaca 
cacctgaggf 
aaaatacaaa 
ggagaatctc 
tccatccagc 
gggacccttg 
aaaagaaaac 
caatacctcg 
gtcgtggaat 
catttggttt 
cattataccc 
aggctggagt 
gattttcctg 
cttatttttg 
ctcctgacct 
agccaccagg 



catttttgtt 
gcccaggttg 
cgtgagccac 
gagtctcggc 
gggcggatca 
ctactaaaaa 
cgggaggctg 
atcgcgccac 
aaaaaaaaaa 
gggattatag 
ttttaccatg 
gcctcccaaa 
acaaagggaa 
ggagtacaat 
tcctgcctca 
ttttgtattt 
gaccttgtga 
gcaaccggct 
cttatgcctg 
agtttgagac 
gccaggtgtg 
cacttgagac 
gggcaacaga 
acctgtaatc 
aaaacgagcc 
aggcatggtg 
gtaaacccag 
gcgacagagc 
agggaaattc 
agatagaaac 
tgagcacttt 
gctaacatgg 
gggtgcctgt 
ggcggagctt 
cgagactcca 
cccagctacg 
gggagctgag 
aaaaaaaaaa 
tttgattccc 
agaaacttgg 
gccttttgag 
ccaacatggt 
aacctgaaat 
ggagtttgca 
tctgtctcaa 
gtggctcacg 
caggagtgca 
agttagctgg 
ttcaaccggg 
ctgggtgaca 
gtgtgtcctt 
gagccgactc 
cttctgcagg 
cacaatgttg 
ccttatggga 
ttttctcctt 
gcaatggcgc 
gctcagcctt 
tatttttagt 
caggtgatcc 
cccagccttt 



ttggggattt 
gagtgcaatg 
tgcaccaggc 
cgggcgcggt 
cgaggtcagg 
tacaaaaaat 
aggcaggaga 
tgcactccag 
aaaaaaagag 
gtacgcgcca 
ttggtcagac 
gtgctgggat 
agatgataat 
agcgtgatct 
gcctcccaac 
ttagtagaaa 
tctgcccacc 
gaaagatggt 
taagctcagc 
cagcctggac 
gcggcacacg 
caggaggtca 
gcgagagtct 
tcagcacttt 
tggctaaatg 
acgggcgcct 
gaggctgagg 
aagactgtct 
caggcacccc 
agcttaaaca 
gggaggccga 
tgaaaccccg 
agtcccagct 
tcagtgagct 
tctcaaaaaa 
tggcaggctg 
atcgcaccac 
aaaaaacaaa 
tgtggaaact 
acatccacca 
aggccaaggt 
gaaaccccgt 
cccagctact 
gtgagctgaa 
aaaataaata 
cctgtaatcc 
agaacagcct 
gtgtgtacat 
gaggtggagg 
gagtgagact 
agggcatgat 
tgtgtctact 
gacggtggct 
tcgatagctc 
aggatcgcag 
t.tttgctttt 
gatcttggct 
ctgagtagct 
agagacaggg 
acccacctcg 
ctttcttttt 



ttttggtttg 
gtgcaatctc 
ctgtttttgt 
ggctcacgcc 
agatcgagac 
tagccgggcg 
atggcgtgaa 
cctgggcgac 
atggagtctc 
tcatgcccag 
tggtctcaaa 
tacagacgtg 
tttttttttt 
tggctcactg 
tagctgggat 
cggagtttca 
tcggcctccc 
aattttaaag 
actttggaag 
aatatagcaa 
cctgtagtcc 
aggctgaagt 
gtctccaaag 
gggaggccga 
gtgaaacccc 
gtaatctcag 
ttgcagcgag 
caaaacaaaa 
gccacccttg 
atagtctccc 
ggcgggtgga 
tctctactaa 
actcgggagg 
gagatcgcgc 
aaaaaaaaat 
aggcaggaga 
tgcactccag 
aaaacaatag 
aaaatataac 
ggtggaaaat 
ggaaggatca 
ctctagtaaa 
tgggagacct 
attgtgccac 
aataaaaaga 
caacactttg 
ggccaacatg 
gtagtctcag 
ttgcgatgag 
ccatctcaaa 
ggttgaggta 
ggagaaagca 
gccagagtgg 
tggggtcttg 
tactataata 
ctttgagaca 
caccgcaacc 
gggattacac 
tttcaccaag 
gcctcccaaa 
aaaattaatc 



gttttttttg 
agctcactgc 
tttcgtttgc 
tgtaatccca 
catcctggct 
tggtagcggg 
cccgggaggc 
agagcgagac 
actttgtcac 
ttactttttg 
ctcctgatct 
agccaccgtg 
gagatggagt 
aaacctctgc 
tacaggcgca 
ccatgttggt 
aaagtgctgg 
tagagaaact 
tccaaggcaa 
gaccccatct 
cagctactca 
gagctgttat 
gtaaaaaaag 
ggcgggcaga 
gtctctacta 
ctacttggga 
ccaagatcat 
caaaagaatc 
acaggcaaag 
ggccgggggc 
tcacaaggtc 
aaatacaaaa 
ctgaggcagg 
ctctgcactc 
tagctgggtg 
atcgcttgaa 
cctgggcaac 
tctcccaagt 
agcttaacat 
gctgagctag 
cttgaggcca 
aatacaaata 
agctgggagg 
tgcactctag 
gaaaaaagtg 
ggaggccgag 
gtgaaacccc 
ctacttggga 
ctgagatcac 
gcaaaaaaag 
tactgctggt 
ctgcatatat 
gaggctttgg 
tacaaaatgc 
catgggcttg 
gagtttcact 
tccacctcct 
atgcccgcca 
ttggtcaggc 
gtgttgggat 
tttgtttaaa 



19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
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aatactctca ttctttattc aattgtagca ctcctagatc ccgaaagcag atacactctt 22800 

gttatgggtc tgactctttt cattgcttca cgccttagag gatattgtcc aatactggat 22860 

aaaagttcac tcaggtctac ttccacttta acggggatgg ctgaatatcC cttccacttg 22920 

gctgtttgtt Cataatgaac tgacaaacat acaaattctc ttgagttctg tgagacattc 22980 

tagtaaatca tctaacctga agagcaggtc gtgagaaccc ctgatttaga aagcccagtg 23040 

gtcataaata taagtggctc tggactggct cccggggtct gaagtgtggg cagtcggtta 23100 

ggatcgagcc cttgtaattt gtaggatctg acacacactc caggaaggca gtgtcagaat 23160 

ttacctgtat tatattggac acccagttag cgtttggaga attggttgct ggtatagaaa 23220 

aataccaaat attttatgtc aggggagtga aagaaaaaac aaaaacccgg ccgggcgcgg 23280 

tggctcacgc ctgtcatccc agcactttgg gaggccgaga cgggcggatc acgaggtcag 23340 

gagatcgaga ccatcctggc taacacggtg aaaccccatc tctactaaaa atacaaaaat 23400 

tagccgggcg Cggtggcgcg cgcctgtagt cccagctact cgggaggctg aggcaggaga 23460 

atggcgtgaa cccgggaggc ggagcttgca gtgagcccag atcgcgccac cgcactccag 23520 

cctgggcgac agagcgagac tccgtctcaa aaaaaaaaaa caaaaaaaaa aaacaaaaaa 23580 

aaaaaaccca Cacactttaa ggaaagcaac tgacagcatt tgttaccagt gataaaattt 23640 

gagctttgaa gtaagaataa caattttgcc attgtgcccg ggccaagaaa aaaaaaagaa 23700 

ttttgccatt gtgaaaggct tcccagtact ttctgatgag cttgacggtg atattaacaa 23760 

ataacttttt tttttttttt ttgagatggg gtcttgctct gtcacccagg ctggagtgca 23820 

gtggttcaat ctcagctcac tgcaacctcc gcctcccagg Ctcaagcgat tctcctgcct 23880 

caacgtccca agtcgctgga ctacaggtgt gcgccaccae gtccagataa tttttgtatt 23 940 

tttagtagag atggggtttc accatgttgc ccagactggt ctcaaactcg tgacctcagg 24000 

cgacccgccc acctcggcct cccaaaggtg ggaggccttg ctgggattag aggtatgagc 24060 

cgctgcacct ggccccctgt ccttgtgttt tgcagtgatg caatgaccaC gtcttacatt 24120 

tgcaaccaga aaaaaaggtt agtgtaacaa ' tgtctaccct gtttttccca gagtagacat 24180 

tatgaagatc aaaaaaattt gaaagtgtct tgaatataat aaactatgct atacacacaa 24240 

cattttggtg actagaaata caagtttatt gttcgttgtt tgttgagaca gggccctgct 24300 

ctgtctccca ggctgggtgg cacaaccatg gctcactaca gtcttgaact cctgggctta 24360 

agcgatcctc ccacctcagc ctccagagta gctgggactg caaacgagca ccaccacgcc 24420 

tggctaatat ttgtattttt tgtagagatg ggg.tttcacc atgttgccca gactggtctc 24480 

aaactcctgg gctcaagcaa tgctcctgcc tcggcctccc aaagtgctgg gatcacaagC 24540 

atgagccact gcacccggct gagtttctgt cgttttaagc cgcttcattt gtggtacttc 24600 

ttacagcagt cccaggaaac tgagcaactg cagaacacca aaattgtttc tcttcagcaa 24660 

aaggagaagc acttgtggtt ggcaccagct tctcctgtgc tcacctctgc atggccgcac 24720 

ctttgcccga cacgagtgca cagcaggctg tgggggagca actggctgag tcaggcctcc 24780 

acttgtgccg tatccccacc tgctttgctg gacacccctg tttggggggc acccactgct 24840 

gccccagaca ccaagcaagc accagctgtg tccaaaactt acagtcactg tcttggcccg 24900 

ttttgtgctg ctgtaacaga atgccacaga ctgggtaatt taatacagaa cagaaattta 24960 

tttcctcaaa gtcttggagg ctgggaagtc caagagcaag gggccatcag gtcagggcct 25020 

ggtctctgct tccacgatgg caccttgacc accgtgtcct cacgtggtca gagagagccc 25080 

actcccagga gcccttttaa tagagcagaa cactgctgcg ctgcggttaa gtttccaaca 25140 

cgtgaacttc ggaggtgaca cattcagacc atagcagtca ctctaggcag agtgtctgat 25200 

gtggttctaa aatacgttca cagactggcc gggcactgta gctcacgtct gtaatcccaa 25260 

cagtttggga ggccaaggtg ggtggatcac ccgaggtcag gagttcaaga ccagcctcac 25320 

caacatggtg aaaccccatc tctactaaaa atacaaaatt agccaggtgg tgcatgcctg 25380 

taatcccagc tactcgggag gccgaggctg gagaatcgct tgaatccagg aggtggaggt 25440 

tacagtgagt cgagatcatg ccattgcact ccagcctggg caacaagagc gaaactctgt 25500 

ctcaaaaaat aaaataaaat aaaatacatt cacaaggccg ggcactgtgg ctcacgcctg 25560 

taatcccagc tacttgggag actgaggcag gagaatcgct tataacctgg gaggtggagg 25620 

ttgcagtgag ctgagatcac accgctacac tctagcttgg gcaacaagag tgaaactccg 25680 

tctcaaaaaa gtaaaataag gccctgcagg catggtggcc cacacctgta atcccagcac 25740 

tttaggaggc caaggcggtc ggatcacgag gtcaggagtt cgagaccagc ctggccaaca 25800 

tgatgaaacc ccgtctctac tagcctagcc aacatgggga aaccctgtct ctactaaaaa 25860 

tacaaaaatt agccgggcat ggtggtgcgt gcctgtaatc ccagctactc aggaggctga 25920 

ggcaggagaa tcgcttgaac ccaggaagca gagggtgcag tgagccaaga ttgcgccgct 25980 

gctctctagc ctgggcgaca gagcgagact ccatctctaa ataaataaat aaaataagaa 26040 

aataaaatat gttcacaaat cctttgacat tcctcacctc aaaagctgga acccaactcc 26100 

ctcctaagca tgagtcttct cagtgactca cttctaacag cagaacttac atggttcccc 26160 

acacccagag gacattgggt tcctcccaat atccccccac ccagcgaccc ccacccaggt 26220 

cgctggcttt gggtccccca gagccatgtt tcaaggacac tcaggcagcc cctggatgtc 26280 

catgtggtaa ggaatgaagg cctcctgcct gcagcctcgg gagggagcat tctcagaaga 26340 

ggatgcccca cctcctgccc agccttcaga tggccaggac ctcgtccaac gtcctgactg 26400 
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caacatcatg agagactccg agccagaaac ccccaggttt tgtactcctg acttatggga 26460 

actgacagat aatgttcgtt gttaattaag gggcgacttg tcacacacaa taggtcacca 2 6520 

aacagctctg tctggcctcc caggaggagc ctgcctttcc tcttcttcaC gggaaaagtg 26580 

cgatcagttc gtgaaggaaC gtccgccccc acttgatgcc agaggctcca catggtgact 2 6640 

gtcataaact ccatctgccc tcagtgcctt gccagcaccc ggcctgcgat cagcttggtc 2 6700 

ttgcgggagg ccaaggccca cgtgtgttcg tgtgtggtgc ctgtgtctgc gcgcccatgc 26760 

atgcccaggg tacagggatg ccatatacaa actctttcaa tgttgtatgt ggcatgtgtg 26820 

tgtctgtatg cccaggatac agggatgcta tatacaaact ctgttttttc gttttttttt 26880 

ttttgagaca gagtcttgct gtttcgccca ggccggactg cagtggcgct atctcggctc 26940 

actgcaagct ccacctcccg ggttcacgcc atcctcctgc ctcagcctcc tgagtagctg 27000 

gaactacagg cgcccgccac cacacccggc Caactttttg tattctcagt agagacgggg 27060 

Cttcaccatg ttagccagga tggcctcgat ctcctgacct cgtgatccac ccgcctcagc 27120 

ctcccaaagt gctgggatta caggcatgag ccaccacgcc Cggcctacaa actctttctt 27180 

tttttttctt ttttttttga gatggagtct cactgtcttc caggctggag tgcagtgatg 27240 

cgatctcagc tcactgcaag ctccacctcc cgggttcatg. ccatCctcct gcctcagcct 27300 

cccaagtagc tgggactaca ggcacacacc accacgccca gctaattttt Cgtgttttta 27360 

gcagagatgg ggtttcacca tgttagccag gatggtctcg atctcctgac ctcgtgatcc 27420 

gcccgcctcg gcctcccaaa gtgctgggat tacaggcgtg agccactgcg cccagcctgc 27480 

aaactccttc aatgtctttc ttttctctct cctgccatct tctcccttgc agatttctct 27540 

tgtctctacg tcttccccag ctgagtccga ggtcctgacc Cgcccacgcc ccctggactg 27600 

gaggagaggt gatagcaaga gctccttcaa gcccaggaat gccaccaggg ctgccccggg 27660 

agaggaggaa gccgggtctc tcggggtCgt ggggaccaga cacccttcta agacatggac 27720 

tcagcacaga aagtctagac atccactaca aacacatctc cctcctaaca gggggcccct 27780 

gggcacccca agtggctgtt cggtgggaca ggcatgtcca tcagtcagaa taCctttatt 27840 

ttttattttt tattttttat ttttgagaga gtttcactgg. agtgcaacgg cacgatctca 27900 

gctccctaca acctccgcct cccaggttca agcgattctc ctgcctcagc ctgccacgta 27960 

gccgggatta caggtgtgag ccaccacacc cagctaattt ttttcttttt Cttttgagat 28020 

ggagtctcga ggctctgtcg cccaggctgg agtgcagagg cgcgatctca gctcactgaa 28080 

agctccgcct cctgggttca cgccattccc ccgcctcagc ctcccgagta gctgggatta 28140 

caggcatgag ccaccgcgcc cggccaattt tgtattttta gtagagacag ggtttcacca 28200 

tgttggtcag gctggtcttg aactcctgac ctcaggtgat ccacctccct cggcctccca 28260 

aagtgctggg attacaggcc tgagccacca cgcccagccc agaatgtctt cttacttttt 28320 

attactctgt cccccatcct gggtccagac ctgtgaccgt gaacaaccgg ctgcccaggg 28380 

gtgaatgggg tgagtggggt gagtccacag aacagtgggg tgcagcccca ggggtctcgt 28440 

agcacctgcc cccaggtcag gaagtcccac agcctagagg ctccagcctc agatgcatac 28500 

atatgtaggc cctgcccttt cctcctgagc ggcgggccac agagtcctga acaacaggaa 28560 

gcccctgagg agggctccgc cctgagggag ggcaggggag cccccgccag ccccacccac 28620 

agcagcgggc cctgccaccc cccaccctga cacctcaccc cttggattcc agagaggaaa 28680 

gtgggcttgt gtgtagttta catgctcata tcttaaaatc accgttgtca atagaacaat 28740 

tcataataat gatgataaaa taagatttat aaccagcttc agtctggaga tacacacaga 28800 

gcagatcttc actcccagac agggagcccg cagctgcccc cgaccccaca ggtgcaggac 28860 

acacacagac agttcaacca tgtcttaaac acacaggtgt ttatttaatt gttcatttga 28920 

ttgaattttt aagttcactt tactacgtgg atgagatggg cgcatattac agtaggcttt 28980 

cgctatgagc gctgccacca tgaggaatat cccagccctc agttctgctt ccctttctga 29040 

gtcccacaaa agccagatgt ggacagcctt gggttcccat cccagctggc tgctccttct 29100 

ggggctgtct tggtggggag agggagatgg ggcagtgggt ccctgctgac ccctgagccc 29160 

tgcaggggtc aggatcctcc cgtggtccct gggtgtggct ctggaagaca ctggcagtgc 29220 

ccggccaagg cctcccgcag gatggaagtt gagggccctg gctctgggtc ctaagagaac 29280 

tcagccgccc ccttcacact ttacagcaag gggccaggca gcagctttgg gatggggctt 29340 

ccgtggagaa gtgggggatg ctgcagtggt acaaagacag cctcccccac cgccatcctc 29400 

cagctgaccg tcctccaagg ccagcactgg gcgtccaagg gaaagaagga actcagccca 29460 

gagggtgtgg gcaggagagg cctggagtca ggcctccacc cacagccccc tctgggtgcc 29520 

aagtgggaag ggtgttgggg ctggcttggg aaccttaccc gctgcccttc caacacctgg 29580 

atctgtgggc agcggtccca caaaatcccc cttggggctc cctgaggagg acttgtggct 29640 

gccgcttcca ccagggcaga gggcacagga ggggccagca ctccaaaggg ctctagggtg 29700 

ggtccttcaa ggacatctgc aaagccctgg tggggagggg cctgggccag aggctctttg 29760 

gaactcttgc acttctgagt gggggactgt ccatgctgcc cacaacctct agaccatgca 29820 

gcctgctcat gggtccctgg cagagaatgc ccactcccca gcagactcag ggcaggcccc 29880 

caactgcagg cttccaggaa. ggcccagggt gtccacctca cgccaggtgg tctcagagga 29940 

cccctgtgca accacattaa ggaaagctgc agcccccacc cacccgcctg ccagttcaac 30000 

aagcaccggc tgcacacgca ggctcccagg caccatcacc cccctccccc gtcgcccctc 30060 
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cctcacgggg agccccttcc ccctggaaag acagcaggta ctgtagcctc gcctgctggc 30120 

caggggcgcc ggctcagagg acctgccctg acctgcacgt gctgaccaga cagcccagcg 30180 

taaggacccg cgatcccacg ccaccgccct gggtttacca cggtcaccac cacctctctc 30240 

acagggcccc cgggggaccc agccgcgccc ggcctggtgt ctgcaccgag ggaccgcgtc 30300 

tcacgcccgg cggctcctgc aggggaagcc gtggtcagcg acccaccacg aggacagggc 30360 

agggcggctg agtgcggaag agaagcatga agccgggggc gggggcgggg gaggaggaac 30420 

aaaagttgca tctagacaga ggtgaacgaa acaaaaccaa aacccgaacg tgttccgtcg 30480 

caggatgggc gccgcccgtc ccgggccctt agcccgacat ctcttctcgc tgctccttgt 30540 

tcctgcgcac ctcggccgcg tgcagctcct gcaggacagg gggcgggagg gcctgagggc 30600 

gggggtggct tggggcgact ccgggaaccc ccaggcgcgc aggccgtggc gccctggcac 30660 

ccgcccggcc tcatccgggc tggccttcgg caggaccctg actgagttga gggggcggga 30720 

gcaccgggga ggcgcagagc aaggccaggg accaaggacg ggtttcctgg gagctggctg 30780 

ggccccgctt ctagctcgta ccggagccga gcttccttca gggcactttc aatataatga 30840 

atttagccat ctattactgc ggctagttac tgtcccgcca ggaccagacc ctggacctgc 30900 

ctcgtgcgct gctggggacg cccagtaaac acgggaggag cccccgaccc ccaccccagc 30960 

tcagcgcctc ggagtccccg gccccgctct gcgccccCcc gagctccgcc ctagccccgc 31020 

ccccgcccag tgccccgccc cctgcctgcC gctagccctg cccccgcccc ggcccctgcc 31080 

cgctccgagc tccgccctgg ccccgccccg gcccctgccc gctccgagct ccgccctggc 31140 

cccgcccccc gcccagtgcc ccgccccctg cctgctgcta gccctgcccc cgccccggcc 31200 

cctgcccgct ccgagctccg ccccggcccc gccccggccc ctgcccgctc cgagctccgc 31260 

cccggccccg cccccgccca gtgccccgcc ccctgactgc tgctagccct gcccccgccc 31320 

cggcccctgc ccgctccgag ctccgccccg gccccgcccc ggcccctgcc cgctccgagc 31380 

tccgccccgg ccccgccccg gcccctgccc gctccgagct tcgccccggc cccgccccgg 31440 

cccctgcccg ctccgagctc cgccccggcc ccgcccccgc accttctcgc gcagccgctc 31500 

gcgcagtgcg gccaggtgtg cctcgcggat ctccttgctg agctccatct tgtagttgag 31560 

cttctcctcc gcctggcggc tgaagttgtt attctcctcc agcgccttgt gcagcacctc 31620 

gcgctcgtgc tcgcgccgct ccgccagctg cttcagcacc tgcgcc'tcct gcgtctgtgc 31680 

ggggccggcg ggcgcgcgtg agcggcaacc ccgggccctg cccggccgga ctcctccctg 31740 

ctctccgcct cccgcccagc gcccgctcgc ctcacctggc gcctccacct gcccaggcct 31800 

cggtgggcgc cgggaccccc gggcgctgcc ctgggaaccc tcgcctgcca tccggcctgt 31860 

ggtcggggca gggccagggg gtcgcgatcc gccgcccccg cccccgtccc tgcctcgcgc 31920 

gcgggtcccg cggtcctggc tgcgcccagg gcccccgcca taccctgccg ccactgcaca 31980 

ccctgccctg cgcgtctgcc cctccaagga ccagcagcaa gaaaccctaa acttgtgggc 32040 

ggtctctgag ctttgtctct tcctcggaca tccgcccact gagcagagta gctgcttgtt 32100 

acacaccggg ttcccagctc ccaattaggt gcccaggagc ggagggtccc cagggatgct 32160 

gggggagggg ccggctggtg acccctggga ggagagcggg gcagcaggac ccgcacccac 32220 

atgccagtcc ctactagtca gccctgtgaa ccctggtctc tggcctcacc gggaagggaa 32280 

cggagccgct tcccctgccc aatgcgttgg cctccagggt ggcaccccca aaaggacatt 32340 

tttatctctg tttcagtctc agaggggctg gtgggagggg aggctgcagg gaggggacct 32400 

ggagcccaca cccacctctc ccagggcccc tccgccctcc agcaagcctc agggtcttca 32460 

cacatgaggc ccttcctcca gcttccctgt ctgggagagg gatgccccac ccgacgtccc 32520 

cagggcccat ctggggacca ccccctagca tcctgctggc cctgacaagg gtgcctccca 32580 

ccctcaccag aggctcctgc tccttccagg tggccgcctc ggaacccttc ctcctctcca 32640 

tccctttctt tttttgttct tgtttgtttt ttgaaatgga gtctcaccct . gtcgcccggg 32700 

ctgaggagtg cagtggcgca gtctcggctc actgcatcct ccacttcttg ggttcaagca 32760 

attcccctgc ctcagactcc ctagtaggtg ggattacagg tgtgcaccac cacacctggc 32320 

taattttgta tttttagtac agatggggtt tcaccatgtt ggccaggctg atcttgaact 32880 

tccaacctca agtgatctgc ctgcctcagc ttcccaaagt tctgggatta caggcgtgag 32940 

ccaccacacc cggcctctcc ccatcccatt cttatctctc agaaagaggc ccagggagcc 33000 

acagcccctc ctgctccagg ccaaggcact gaccaagcct gtccgggagc accctgcttc 33060 

ttgcaggccc tgtccccgtg ggccgcctcc gttgaaactc ctggggggtg ggggatggag 33120 

gactccttgc cttcctccgc tcctcggctg cctccagccg cttttgcagc tcctccaggg 33180 

aggtgtcctt cttcttgggt ggggaggaga gcatagggct ctctggggac aggtcagaag 33240 

gggacttgag gatgacctcg aagctctggc ctgaggcccg cttgtccagc tgcttcacct 33300 

ccatgtctgc agggcaagac cagagtagag cttcagaggc ccggccaggg catggcgtgg 33360 

gctgagcggg atgctcccag cacacatcca accccagggc tgggcgagag ggggtggctg 33420 

ctcccgcagg aatcccaggc ttcagccccc aggatgggcc ccttccccct agaacctccc 33480 

tctccagagg cagccaggac gggagttcag agagactgcc ggaggccggg ggaaaaggtg 33540 

aggtgggcag gcaccgcagg gaagggcagg cggcagccag gcactcaccc ccgtactggt 33600 

agacggtatt ggggtgcggc tgtgtgtaga agcaggagca gatgagcgac agcaccgaca 33660 

gctccttcat cttctccttg taggctgtgg gcacaaggct gggctgagca agcaccactg 33720 
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gggcctgccc acctgggccc ccgttttccc tccccatggc tgcctctatc atgtctccgt 33780 

gagacacgga gctgcccagc acgctccctt gtgtgtctcc acaccgccgg ccccttcgtc 33840 

tctccagctc tctcgcttcc agacgtcggc actgtctccg tggtgtgtcc cctgccctct 33900 

gtctctctcg ccctctgcct ctccccgctt ttcctctctc tcggcattaa tgtctgtctc 33960 

atcttccaca ctgacttgtt tctccatcct tctcctgcct gctgtggtct gaatgtttcc 34020 

attacccaaa actcatgtgt tgaaatcgta accccaaggt gccggtgtgc ggaggtgagg 34080 

cattcggagg gaattaggcc atgaggatag agccctccta agtggcccca gagtggggct 34140 

tcagagaact ccctcacctt ccatcatgtg aggacacagc cagaagacgc cacccgtcta 34200 

tgtaccagga ggcgagacct ctccaggcac cgactctgcc ggcaccCCga tcctggactt 34260 

tctggcctcc agagcgatgg gaaataagtt cctgtcgtct ataaaccact cagtctcagg 34320 

tacctgccca gactgacaaa gtggctaccc ctgcctgtct gggtctctgt ttaccttctg 34380 

tgtgtctgac tcCgtcactg tcattgtatc tttctgtgtc tctgggggta gcccctgact 34440 

ctgtctttct ccctgagtgc atctttctgt gattccttgt cactgtgtgt ctttccgact 34500 

cCtacctccc tcCgtcccgc tacttctctc tcccctcctc ctccttccca ctcctcgcca 34560 

gctcaagcag gcaagatcta ctcatgacgg gaccagcaca gatgcaaacc ctctgtgggc 34620 

aggactttct tgggcCgCaa acctggatga agccctcaga ccctcctttt tccttcccaa 34680 

Cgattgtgtg gtcacctcga gatgaaacca ggccctctcc aggcacatgc tctctgtcta 34740 

tGtagggctg ggcttgggcc actgatgcca ccaaggagca agggagggaa gctgtccgtt 34800 

cagcaccaca gccagccctc ttgcccattc aggtcaatca agtgcccacc agccagtgtc 34860 

cctgctgccc aacccaaacc agaagcaagc cgggctcctg tggccctgtg ccctgtcagg 34920 

ggaagaggaa ggcgcctgct gtcacagtga aaataattta gctcttttgg tctattcagg 34980 

gcgaacctca ttcctaagca gacacgctgg cccggtttct cactagtgct cgataatcct 35040 

tttggctggg tgcagtggct catttaactg taatcccagc actttgggag gccaaggcag 35100 

gtggaacacc tgaggtcagg agtttgagac cagcctgacc aacatggtga aacccgatct 35160 

ctactaaaaa tataaaaatt agccaggcgt ggCggcaggc acctgtaatc ctagctactt 35220 

gggaggctga ggcaggagaa tcgcttgaac ctgggaggcg gaggttgcag tgagccgagg 35280 

tcgcgccatc gcactccagc ctgggtgaca gtgtgagact ccgtctcaaa acagaaagaa 35340 

aaagagagag aggaagaaag gaaggaggga gggagggagg aaaagaagaa aggaaaggaa 35400 

aggaagacag acaaggcaga agtaatcaag cctttcatgg tgagctgggt cttctggtga 35460 

cagtgcagag aatggtctgt cctgacttaa atctcctggt gacctacact tttctggaca 35520 

gagcagcaca gagcccaaga gggCgtaagg aggagcagaa aggaatccca gggtgggcag 3 5580 

gcccgtgcga gagcctttgg gggaaggaat gagactttga gccgggaagc gaggcaaagc 3 5640 

tacctgtctt ggtcattgtc ttcagggagg gagatggagg gggaccaggt gggggagcct 35700 

cacaggggac tttggtctga cttgtcaagt tttctttctt tctttttgag atggagCctt 35760 

gcactgttgc ccaggctgca gtgcagtggt gcgatctcgg ctcaccgcaa gctccgcctc 35820 

ctgggtt'cac accattct'cc tgcctcagcc tcccgagtag ctgggaccac aggcaccgcc 3 5880 

accacaccca gctaatcttt tgtattttta gtagagacgg ggtttcacta tattagccag 35940 

gatagtctcg atctcctgac ctcgtgatcc gcccgcctcg acctcccaaa gtgctgggat 36000 

tacaggtgtg agccactgtg cctggcctac tttaCttCtt agaaacagga ctgtgctctg 36060 

ttgcccatgc tggagtgtag ggtgcagctg tgcggttcac tgcagccttg aacttctggg 36120 

cttgacggat cctgccatct tagcagctgg gactacaggt gcatgccagc acaccagttt 36180 

tctttttttt tttatctctg ctcactgcaa tcccgcctcc tgggttctag cgattctcct 36240 

gcctcagcct cccaagtagc agggatcaca cgcacatgcc accacacccg gctaattttt 3 6300 

gtatctctag tagagacagg gtttcactaC gttggtcagg ctggtcttga gccaccgcgc 36360 

ccgcccggcc tacacaccag cttaaaaaaa agaaaaaaat agctgggcgt ggtggctcat 36420 

gcctgtaatc ccagcacttc gggaggctga ggcaggcaga tcacctgagg tcaggagttc 36480 

aagaccaacc tggccaacat ggcgaaaccc tgtctctact acaaatataa aaatcagcca 3 6540 

ggcgtggtgg cgggctcctc taattccagc tacttgggag gctgaggcag gagaatcact 36600 

tgaacccggg aggtggaggt tgaagtgagc caagatcgag ctactgcact ccagcctggg 36660 

agcaagactc ccgtctcaaa aaaaaaaaaa aaatttgtag tggtatggag gccgggcatg 3 6720 

gtggctcacg cctgtaatcc cagaactttg aggggccaag gcgggcagat catgaggtca 36780 

ggagttcgag accagcctga ccaacatgat gaaaccctgt ctctactaaa aataacaaaa 36840 

attagccagg catggtggcg ggcacgtgta gtcccagcta ctcgggagac tgagacggga 36900 

gaatcgcttg aacccaggag gcagaggttg cagtgagctg agatcacgcc actgcactcc 36960 

agcctgggtg acagagtgag actctgtctc aaaaacaaac acaaacaaac atatatatat 37020 

atacatgtat atatataata tatatatacg tatatataca cgtgtatata tataatatat 37080 

atacgtatat atacacgtgt atatataata tatatacgta tatatgtata tattaatata 37140 

tatacgtata tatacacgtg tatatattaa tatatatacg tatatataca cgtgtgtata 3720.0 

tattaatata tatacgtata tatgtgtgtg tgtgtatata tatatgtata tatatatata 37260 

tatatacata tatatataca gagagagaga gagtagtgat aggtcttgct gtcttgtcca 37320 

ggctgatctt gaactcccgg cctcaagaga ccctcccacc tcagcctccc aaagcactag 37380 
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gattaCaggt gtaagccaca gtacctagcc tattaaaaat taatgttaaa caagaggatg 37440 

tgatgaggga gttagagggt gtgccagcca tgtgttccac agcagcaggt caggagacat 37500 

tggggacatt tagaggagct gaagaggtgg ccaaccctgt gctcaggagg acgggggagg 37560 

gagagagcaa gagggagttt gggctggggc agaacgtacc tgggtcctga gaggataaga 37620 

aggtagggac ttggcccctc caggcctgac tctgccagca accagctccc tatcagcaga 37680 

ctccaggccc ctacccttca gctcatcctt ccttatcaca catccaaaac tccgaatgtg 37740 

gccgggcgca gtggctcacg cctgtaaccc cagaactttg ggaggctgag gcaggaggat 37800 

cgcttgagaa caagagtttg agaccagcct aggcaacatg gtgaaacccc atctctacta 37860 

aaaataCaaa aattagctgg gtgtggtggc acatgcctgt tgccccagct actcaggagg 37920 

ctgaggcagg agaatcactt gagcctggaa ggcggaagtt gtagtgagca gagattgtgc 37980 

cactgcgttc cagcctgggc aacacagcga gactctgtct caaaaaacaa aaactggaat 38040 

gtgtttacca taaaggccag aaaatgtgat taacagctgc tcaaagcccc tgtctgccct 38100 

aagcctgaaa tttccaccga aaaaaagatc tgtaggctca tacagaggaa ggacaaacac 38160 

cagggaggct ctcttccagt ttgcttcacc tcagcaagca gacggctggc agcaatttgg 38220 

gggcaggtgt gagcacctgc atcaCcagga aagaaggggc acggtgggga cgcaggtcag 38280 

acctctcaca ggtcttggct ctgcccagga gacacgtgtc caactgagag gtgaggaact 38340 

gggttctgca gctgcagaca caggtgcggc tcagcatctg atggccacgg agaccccctg 3840 0 

gcttggcttc tcccagctgg tggcccatga ggagcttcta tcccaagaga ctgtccctca 38460 

aggagcaagt gggaccaggt acccacagga cggagcctgg gagtgaggcc tgccctgtgg 38520 

tctggctaca gggaggaagg gcagattgga gggggcagga cagcaggtca ggaattggcc 38580 

aaetctggag agagcaagca aggggaagtc tgcgcacagg gcagggctgg tcaggggcga 38640 

ggcagggcat tggaccagta ttttcagagc tggtgaggct taaagagcat gtctactgcc 38700 

tcttattaca gagagaggaC gccgaggccc agacccatcc aggccacctc tccacagaca 38760 

cagctggtgc cagggaagcc cctcccagag cctcaaggca ttgccccctc tctctctccc 38820 

tttttgcttt tttggagacg gagtctcact ctgtctccca ggctggagcg cagtggtaca 38880 

atctcggctc acggcaagcc ccgcctcccg gattcacgcc attctcctgc ctcagcctcc 38940 

cgaatagctg ggactacagg cgcccgccac cacgcccagc taattttttg tatttttagt 39000 

agagacgggg tttcactgtg ttagccagga tggtctcgat ctcctgacct tgtgatccgc 39060 

ccgtctcagc ctcccaaagt gctgggatta caggtgtgag ccaccgcgcc tggacttttt 39120 

ttttttttca agacggggtc tcactctgtc acccaggctg gagtgcagtg gcgcgatgtc 39180 

ggctcactgc aacctctgcc tccccagttc aagtgattct cctgcctcag cctcccaagt 39240 

agccagaatt acaggcacat gccaccatgc ccagctaatt ttctgtattt Ctagtagaga 39300 

tgaggtttca ccatgttggc caggctggtc ttgaactccC gacctccggt gatctgccca 39360 

cctcagcctc ccaaagtgct gggatgacag gcgtgagccc ccgcgectgg ccccccgcag 39420 

tgctgggatt acaggcgtga gcccccgcgc ccggcccctc cctctctttg actcccttct 39480 

ttctcaccgc cccctcccca ccatccttcc ccttcactga cttcagggag ttaaaaacaa 395.40 

ttctcgcagt gagctgggcC tgtctcgtct ccctgcttct ctttgtacca aacattagat 39600 

accgaggaaa tgcggattgg cctttggatg attcatgagc aggagtcaga aaaaggcacc 39660 

aggttggcct caagcagcag ggtatagtag tgcccgctcc cagggtcaca cctcacgccc 39720 

acccctcccg ccgtccaggt ggatggtgcc cactcccagg gtcacacctc acgcccaccc 39780 

ctcccgccgt ccaggtggat ggtgcccact cccagggtca cacctcacgc ccacccctcc 39840 

cgtcgcccag gtggatggtg cccactccca gggtcacacc. tcacgcccgc ccctcccacc 39900 

cacccgggtg gatggtgccc gctcccaggg tcacacctga cgcccacccg ggtggatggt 39960 

gcccgctccc agggtcacac ctcacgccca cccctcccgc ccgcccgggt ggatggtgcc 40020 

cgctcccagg gtcacacctc acgcccaccc ctcccgccgt ccaggtggat ggtgcccact 40080 

cccagggtca cacctcacgc ccacccctcc cgccgcccag gtggatggtg cccactccca 40140 

gggtcacacc tcacacccac ccctcccgcc cacccgggtg gatgccctta tcagctctcc 40200 

ttctccttct ctttcgtctt cttcgtcttc ctcctcttct ttcttctttt tttttttttt 40260 

tagaaagagt ttctactctt gctgcccagg ctggagtgca atggcacaat ctcagctcac 40320 

tgcaacctcc ctctccccgg gtcaagcaat tatcctgcct cagtctccca gattgctggg 40380 

atcacaggag tgtgtcacca cacctggcta attttgtact tttagcagag aggggggatt 40440 

tcaccatgtt ggccaggcta gtctcgaact cttgacctca gtttatccac cggcctcagc 40500 

ctctcaaagt gctgggatta caggcatgag ccaccctatc tgcctcactt ctacagagga 40560 

ggaatgaagg ctcagagagg gcaagcattc cacccagcat cacacagagt gccgggtgag 40620 

agcccagtca tgagcctggg cctgactgca ggctcctgtt .gggagctcgc ggaggtgggg 40680 

gatctgtcca gaactgagag gccaggggac cacagtggcc tctgacccct ggagggccct 40740 

ggaggctgct gccggctccc cccgggggca gatggaggtc actgtcaccc aggctgcttc 40800 

tcatggtgcc aggagcacag catggcagga gccaccagcc gatttgcctt tccctgggca 40860 

ggaaactcag aaatgtggct accacagtca ggctgcttga cgtgcggtga gcactcatct 40920 

cttagcaggc aagcggccaa gcacctttcc tgaaatattg aggcctcaga acaagcccca 40980 

ggagaggtgc cagcaccgtc atctctaccc agataaggag acccaggtcc tgagaggtta 41040 
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ggcagctcgg acaacaccac acagctggag gaggtcagac tcCgggttgc agaaggagaa 41100 

tgtgagcaga ggccacaaaa gagcgaggag ccagtgccca gatgccgaga tgccctcgcc 41160 

cCcccagctc agccccagga accgagccca tggggaggga ccgtcaggga aaggcCgcca 41220 

ggaagggcag gaggcggccc tggagaggac ggcgctgccc tcaggggcag gaggggagtc 41280 

ccctccgctg agagcccccc cacccccagt atccccgggg gtgtccagga ggaggcggag 41340 

ggaggaagcg cagatggaca ggactcccag atagggtggg gaggtgtggc cggtgacaca 41400 

cacggtcccc tcctggcagg tgctgaagtc acctggagcc tccaagcccg tggggcctga 41460 

ggggcggggt caggtcgggc acgcgCgggt gggcggagtt ctgcgccccg ggccaaggcg .41520 

cccgagttga accagtcagc tcgggagagg gaccgcggcg acctgtcccg ggggcgtaag 41580 

aaaaggtggg agggagtgcg gctcgtgaac gggggcggcg atgggaagga ggtgcggccc 41640 

ttcgtcctgt cctcccaaac gtcgagtgaa aaacgaagcg ggttctgcgg cctcgcggcg 41700 

gagcagagcg tttcgggaag ggcgggccca gcgtcctcgc gcccgaggtc gcccggcagc 41760 

tcccctgcgt ccagaatccg ccccccgccc gggcccgcgc ccgcccctcc gcctgagctc 4182 0 

cgcgcgggac gggccgggag gccggggtgg gcgctacctt cgaaggcggt gggtccgccc 41880 

cgcgggaggt ggaggggcgg gaggggcgga gccctctggt ctccggaggg tttggggatc 41940 

gcagtcgccc ctcccccatc cagaccccgc ggcgcaaagg gcagtggctt ttctggccag 42000 

agcaggtggc gcgggcgtcg caaagggtgg tccccgaggc cgcagcggtg tggggggagg 42060 

gcgcggtccc cctcactccg ggctccgccg tgtctggccc -gcccccctcc ttcagcgccc 42120 

cctccagccc ctgtgctgca ctggcgcggg gagcgccggg ttcccggctg gggctttggc 42180 

agagggtccc accctctccc cgcctcccca cgaaggctct ggcggaccca gatctcgggt 42240 

cgccggacgc cccagggacc ccgcccgcac atcgcgagcg cgcccacccg gtcgcgagcc 42300 

cacgcccggg tctgggagcc accctgcggc agtcgcgccc tgcgtggcac gcCgctcccc 42360 

caggggcgag gcgcccccgc ccgacgtccc ggtcccgagc gctccccggc gcggcgcctc 42420 

gcagcccagc gccccaccag ccccgccggc gccgcagacc ccagcctcgg gcgggteggg 42480 

cccaggcttg caacgcgcag ggtaggagaa gggaaattgg cgtccgctgc cggccgctgc 42540 

cccaggcgag gccagacgag gcctctgctc agatcccgcc gccccacaaa gcccgtggcc 42600 

ccggagccta ccggaaatgg tgctggccat ggtgctggcg gcggttgggc ctgcggaggc 42660 

tggagaggcg caagtggcgg ccggagctgc agacggctgg tgctgcagtg ccggggaggg 42720 

gaggggagag gagtggaggg agcgagggcg ggcgggaggc gggcgcggcg ggagagagag 42780 

agggagggag acagagggag agagagagag ggttggggga aggagcgggg ggaggaggga 42840 

gggagggtcg ggggaaggag agagagagag agagagactg cgggggcggg ggaaggaggg 42900 

agggaggaag ggagggagga agagagagag gagcaagcgc ctggctgcgg aaggggccgc 42960 

ggctctcagg gggagagggc ggaggagggg ggctacccga actgcaacaa gaccccccac 43020 

cctccaaccg ctcacagcgg gacagctgct tctccaactt ggctttgtga ggcctgagag 43080 

tggggtgggg gtggagatga gcccccattc cccagggcag gcggggcagg ggcaatgccg 43140 

gaggagcagg tcccacccat ggggtggggc cgcagagctc ttcgccgcca aggccgctgt 43200 

aggctgggct ggcgccaaca gggtccaggt ctgtgcctgc catcggagag gacgccacag 43260 

ccacaggggt gggcgctggc ctggaggcct ccaaggggca tctcctgtga gcccagggga 43320 

tgggcaggat ctgagcggag aagagcgaaa gtggaggagt gaggccagaa caaaggcttt 43380 

gccgtgaaag aggtggtttc ccgcctgggc tcagaccttc actcactgtg tggcccaggc 43440 
caagggcaag cgtctgacct cgctgggcct ttgtttctca ggggtaagat gaaacaatga . 43500 

tgcccccaga cgatggagag gaggggtgcc agggttgtgc gcacttagtg agtggggggc 43560 

aacctatcct gcctccccct ctcctcataa ctcccaaagg gaaagcctgg taggcaaacg 43620 

gagcgtcttt gccattgcag ggatgaagcc accgaggcag ggagaaaagt gctttgccct 43680 

acaagcaact aagtcatagg gccaggagca aaaccctgaa aacctcagga gacttgcaga 43740 

gccatgaggc tggctcagca acacaaaagc caggggcaag cctcagctct agcagtgcgg 43 800 

tgggagcacc caaggccagt cacatcctag ggtggcctgg agagtcctga cccctgacgt 43860 

gcaagccggc atcatccccg ggactgtgag tctggtgggg gtgatgccca ggaatgtgac 43920 

attgtgtggc ccagaggtac ccttaagact ggaggatcac caggcgggcc ctgacctcat 43980 

cacaggagcc ctttaaaagc agtttccttt gcctggttga agaaatcgga gggatcaaac 44040 

caaagaaggt tttccgttgt tgagatgagg gggccacgtg gcaaggatct gagaactgct 44100 

cccagccaac agccagcaag acaacaagac cttaactgca aggaagtgag ttctgccaac 44160 

aagaagagaa tgggcttgga ggcaggtttg accccagggc ctccacacaa gaactgagcc 44220 

caactgccca cttggtttca gccttgggtt actaagaatt aggaggtaat gaatgagagt 44280 

tgttttaagc tgttggtttt gtggtgattt gctatgaagc catatcaaac taatatacac 44340 

acagaggtgt tggcccctgg gccattccta ggaagccagc tctgcgaagg aggaagaagg 44400 

gcagagaggc acacagagct gcccaccaca gcagctgtgt cctccctgtt ggccaccaca 44460 

gtagcagttg gggatggtca gcatccttca ggcagacccc ■ agccccgggt gctggagctc 44520 

aggtgctagg gatcaagaga agtagccctc tctgggacct ccagagtctt ctcatgtggg 4.4580 

tggggtagga cccacccagt caggctcaga gcaccgcaat gcctcacact cattgtgact 44640 

ctggccaggc cctctctgag cctctgtgtc ctcatctgga gcacagggac caggtgtgtg 4470Q 
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gaagcccgtg gcatagtgcc aggaacacag tagatgtgca cagtgtgcac .tagcaggaac 44760 

acacaacagg ggtactgact gtcagcaccc aggcaggcac acgcaatggg gCactgactg 44320 

Ccagccatac tgactgtcag cgtgctagca ggcatacaca acagcCgtac tgacagcaca 44880 

ctagcaggca catgccatag gtgtactgac tctcagtgca ctggcaggca cacgcaatag 44940 

gagtaatgac agcatgctgg caggcacaca atagctgtac tgactgtttg ccccaatata 45000 

gtgccaggtc ttggagcaga ttttgacttc tcaccaagat caaatgcaga aagtgcacga 45060 

gcatttcaaa gatgtttttc acatgcacat tagtgctagt taaaaaaatg ttttgactgg 45120 

gtgcagtggc ccacaactgt aatcccaaca ctttgggggg ccgaggtggg cagatcacct 45180 

gaggtcagga gtttgagacc agcctggcca acatggtgaa accccatcta ccctaaaaat 45240 

acaaaaatta gccaggtgtg gtggcaggtg cctgtaatct cagctacttt ggaggctgaa 45300 

gcaggagaat cacttgaatc caggaggcag aggttgcagt gagccgagat cccaccactg 453 60 

cactccagcc tgggcaacaa tatcaagact ccacctcaaa aaaaaaaatg tttttcataa 45420 

agtgtgactt ttatcagacc tctgcattct tgaaattaac tctggcttgg ctgggcgtgg 45480 

tggcccacac ctgtaatctt aacactttgg gaggctgagg tgggcagatc acgaggtcag 45540 

gagttcaaga ccagcctgac caacatgatg aaaccccatc tctactaaaa atacaaaaat 45600 

tagccgggcg tggtggcatg cacctgtaat cccagctact caggaggctg aggcaggaga 45660 

atcgcttgaa cccaggaggt ggaggttgca gggagccgag atcgcaccac tcCattccag 45720 

cctgggcgac agagcaagac tctgtctcaa aaaaaaaaaa gaaagaaaga aattaactct 45780 

ggctcctaga aggagcccta tatctcagca ggacactcag tcattcaaca gacatctgtc 45840 

aagcacctgc tgtatgctgg agctgtgggt acgtcagcaa ttagaggaag agggcagggg 45900 

tacaggagtt cctgaccacc ccaggccagc acgctcctat agcagctggc aaggagcaga 45960 

tgactcagac ttcagctcag tccacaggac agccttttct ggccactgct ctcaggagat 46020 

gagatgtgtg gctgcaaaag gtaaactcct ggctcctgag caggctctgg gcaacctgct 46080 

caacgctctg tgcctcactt tctcacccag aaagtgtgga caatgagagg acttatctgg 46140 

ctgggcgcgg tggctcacgc ctgtaatccc agcactttgg gaggccgagg cgggtggatc 46200 

acctgaggtc aggagttcaa gacctgcctg gccaacacgg tcaaactcca tctctactaa 46260 

aaatataaaa aactagccgg gcttagtggt gcacacctgt aatcccagct acttgagagg 46320 

ctgaggcagg agaatcactc gaacccagga ggtggaggtt gcagtgagcc aagattgtgc 46380 

cactgcactc cagcctgggc aaaaagccaa aactctgtct caaagaaaaa agaatcatgg 46440 

cagaaggtga agtctatgtt agtcccagtt cccaggtcgt acatggcggc aggagaaaga 46500 

gagagagaag gggaaactgc cacttttaaa ccatcgggtc tcctgagcac tcactgtcag 46560 

aacagcctgg aggaaactga ccgcatgatc caaccacctc cctccaggtc cctccctcca 46620 

cacgtgggga ttacaattcg aggtgagact tgggtggaga cacagagccg aaccatacca 46680 

gcatgtatgg ggggcactga aacttgtgct tggtgcccat tcattcaacg agtgtgtgtg 46740 

gctggtctcc tcatcttcaa ctccctgccg agtctcagat aggcagcctg cagttccttc 46800 

accacaacag gcacatgggg ctgggtgcca gtgagtgctg gggcttctcc gagcactatc 46860 

tcacacccag gagcgtgggc acgcatggca ttcgcatgtg ccgtcagCgg acattaaaca 46920 

cagccatgaa gaagccacga agaagtgctg cctgccggcc gtgcgcggtc acgcagcgcc 46980 

aactcccccc tggggccttc tggggccttc tggggcatgg gagctggggc cgcctgagac 47040 

aaacatccgt gacgctgggc tgaccccaca gaacggtgcg ggcctcgctc ttggagtcag 47100 

ccctgctgcc agccagtgcc gggtgctggg gactcaggga ggcccgccgg gaccactgcg 47160 

ggacagtgag ccgagcagaa gctggaacgc aggagaggaa ggagaggggg cggtcagggc 47220 

tctcaggagc cgggtcctgg gcaaggcgca gccgttttca aattttcagg aaagcggtcg 47280 

gctcacactc gagcagtaaa aagatgcctc tggggaggag gcccgtgcag ctctccgggc 47340 

aatggtggtg gctcggccta gagaggcggt agtggaacgc agaccctggt gggggaatga 47400 

catcaaggga ggagacggge gggaccccag atttctgccc gtgggcgatg gaagtgaggt 47460 

tcactggcca gcggagccgg acacagaacg cgcaaaacgc cgtgtaggcc tggaggagcc 47520 

gaagagcagg cggaccccct ccgcggggga acagtttccg ccgggagcac aaagcaacgg 47580 

accggaagtg gggggcggaa gtgcagtggg ctcagcgccg actgcgcgcc tctgcccgcg 47640 

aaaactctga gctggctgac agctggggac gggtggcggc cctcgactgg agtcggttga 47700 

gttcctgagg gaccccggtt ctggaaggct cgccgcggag acaagtgagc agtgagtcgc 47760 

agtgacccta caagtggttc ttttacccga gcggctcgta ggcgcgttgc ggtttttcga 47 820 

aactacagcc cccggcaggc cccaagccgc cctcggggcc gcgggtcggc ggattggccg 47880 

cgctgcattt tgggacctgt agtttcctgc gctcgtggcg ctggcgccgc ggcgtcggct 47 940 

gagcccttga ccggggctgg agggaagggc cgacactcag tgtgtccgcg tctgttctgt 48000 

tagtcccagt tcccgggcgg gattgaggct tagagaagtt gagtgatttg ctgagggctg 48060 

cacgggttgg catcccggca tgctctttcg ctactttggc tgcatctggt tgcccacccg 48120 

ggcggatggg gaatggactc cagccagcca ggagggcaga gggctggaga ggcagggccg 48180 

gaggttcaga ccctccgctc cgacgttgcg cctggtgagg ccgggagggg tgccgcttgc 48240 

ctcttzcagcc ctcacgctct tgtggaagtc gcggaattac Cgcaggcgga acttgcagca 48300 

ctgtgggcgt cttttccaga gaaggacgga gttgtggggc gggaggataa ggcaaggccc 48360 
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agccacttcg catcttcgcc ccgccagctc ctcgagatgg gatataccag ggttgctctc 48420 

caaccctctc cgcaggaggg actgatggaa acgcctggga aagtagcccg gtacccacaa 48480 

aggctgtcta caaacagagt cctactgtct ttcccaggtc cgtgccatag ggattctcga 48540 

agagaacagc gttgtgtccc agtgcacatg ctcgcatcgc ttaccaggag tgcccgagac 48600 

cctaagatgt tcggagtggt tttttcgcac agacccgaat agcctgcccc tcagccacgc 48660 

tctgtgccct tctgagaaca ggctgatatg cccaagatag tcctgaatgg tgtgaccgta 48720 

gacttccctt tccagcccta caaatgccaa caggagtaca tgaccaaggt cctggaatgt 48780 

ctgcagcagg tagagcacag gccccgagga aaggactgcg ggtgggtgga gcttcagcca 48840 

ggacggggtg tgcttccctc tcccggccca ttccagccag gcccctccgg gccagaggca 48900 

gcgtctgtca taaaaagggc tggtgttcca ggtggggtca gagagaggat tgacaagtaa 48960 

aaacgatcgt cctttgaagg gggccggccc ctccacacct gtgggtattc ctcatcaggc 49020 

gggacgagag accgagaaaa tgaataagac acagagacaa agtacagaga gaaaagtggg 49080 

cccaggggac cggcgctcag catacagagg acctgcaccg gcaccagtct ctgagtttcc 49140 

Ccagtattca ttaattacta ttttcactat ctcagcaaga ggaatgcggc aggacagcaa 49200 

ggtgatagtg gggagaaggt cagcaagaaa acgtgagcaa aggaatctgg gtcacaaata 49260 

agttcaaggg aaggtactat gcctggatgt gcacgcaggc Cagttttatg cttttctcca 49320 

cccaaacatc tcggtggagc aaagagtaac agagcagcat tgctgccaat atgtctcgcc 49380 

tcctgccaca gggcggcttt tctcctatct cagaattgaa caaatgtaca atcgggtttt 49440 

ataccgaaac attcagttcc caggggcagg caggagacag tggccttcct ctatctcgac 49500 

tgcaagaggc tttcctcttt tactaatcct cagcacagac ccttcacggg tgttgggctg 49560 

ggggaccgtc aggtctttcc catcccacga ggccacattt cagactatca catggagaga 49620 

aaccttgggc aatacccggc tttccagggc agaggtccct gcggctttcc gcagtgcatc 49680 

gtgcccctgg tttatcgaga ctggagaatg gcgatgactt ttaccaagca tactgcctgt 49740 

aaacatattg ttaacaaggc atgttctgca cagctctaga tcccctaaac cttgattcca 49800 

tacaacacaC gtttctgtga gctcaaggct ggggcaaagt tacagattaa cagcatctta 49860 

gggcaaagca attgttcagg gtacaggtca aaatggagtg tgttatgtct tccctttcta 49920 

catagacaca gtaacagtct gatctctctt ttccctacag tccttgaggg tgacagactt 49980 

aggagtgcct tgggggcctc tctgaggagc agctgatatt cacgggtcag gaggaagcat 50040 

ttccattaga ggggcagccg gtggccagcc tcacttggaa ggtctttgaa cctcgggggt 50100 

gcagggaggt ggcagtggtg caggttgcct tctcccgggt tccttgaggt gccctcttgt 50160 

acccggctca cacccctccc ctccccgagt ttcctgctca ggtccccgtc tgagagcttg 50220 

tatgtaggac gtcagatagg acagcataaa tgtttggatc cagaaacgca gaacagtttc 50280 

ctattttgag acttgacacc taattagtca tcttactatt taagctgaaa aatagtgtcg 50340 

tgttttgggt aacgttctgc aaatcgtttg ctaatggcgg ctgagttgct tcacgccctt 50400 

tagggcaaga gtgggacttg cctgtggact tctccgcggt cccacagggc tctcgccacc 50460 

tggcagtggc ctctgcatct gcaaagagct gcccgctggc cgccgaagct tgtctcaggg 50520 

cagcttgtgt ggcctcgcct cttcctggct tccccgtaac cctcgccccg aactccgttc 50580 

agaaggtgaa tggcatcctg gagagcccta cgggtacagg. gaagacgctg tgcctgctgt 50640 

gcaccacgct ggcctggcga gaacacctcc gagacggcat ctctgcccgc aagattgccg 50700 

agagggcgca aggagagctt ttcccggatc gggccttgtc atcctggggc aacgctgctg 50760 

ctgctgctgg agaccccata ggtgacccta gttcccaggc ctctcctggc ctcctgtggg 50820 

gatggttggc aagggatggc gctgagggtg gggtgggccc atggggactc ctgccgtctc 50880 

tcaagcagaa ctcaaggaga attctttagc tgctgtataa tttctcgcca tcgtgggtgt 50940 

aaacctaggg ttgggctttt ttgctgaatt agggcacggc agatgcccac ttcacccatt 51000 

tttgataaac cagtatctgg ggtgtcagat tcttggctgt ctgcagggcc gagttagccg 51060 

aatgccacct gcctttgata cgtgagaacg ttgtctgaga accgtgactt ctgtgcttgc 51120 

ttgtgtctgg Ccagcttgct acacggacat cccaaagatt atttacgcct ccaggaccca 51180 

ctcgcaactc acacaggtca tcaacgagct tcggaacacc tcctaccggt gggtcagacg 51240 

agtttacacc Cgtctcgggg tcctcaagag aaccagcttg gcatggtgct gagtccacag 51300 

ccccatgctg tgctgtggtg gagggtggtg gtctttctag acgctccccc gaagtgtgca 51360 

gagcgctggt gcccaggggt ggggtgcggc ctgggctgcc tccaatgccc attacttgtg 51420 

aggaagcagc tttgcatctg tgtgctgacc ttgggcgggc gtcctgagct cctcgcaggt 51480 

gctgttgtag cagctgtgca gtaggtcagg gctggccccc agtgcagctt tgcacatgaa 51540 

gtaggaggag gccctgctgc ttgtcagagc ccagcagagt cttggtgttc tgtcgggttc 51600 

ctgtggccgg accagtggca gggtgctgtg gaagctgtcg aatctcctcc ctctgtccag 51660 

tacccccgct cgtcttctag ctccctccta cgcccgggcc acgtttcagt tatgctcact 51720 

tcctctgacc gccgaggctc ctgcgtgtct ccatacagct cacgctgcag ggccacgctg 51780 

tgggtgttgg agacagctcc tcctcgaccc acggtgctct ctcccaccag gcctaaggtg 51840 

tgtgtgctgg gctcccggga gcagctgtgc atccatcctg aggtgaagaa acaagagagt 51900 

aaccatctac aggtaggctc ctgggctccc gctccggctc agtgtccgac aggcgagtgc 51960 

tgctgggtgt ccagagcccc aggctgcgct cccgctgggc tagggtttga agttcactgg 52020 
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gggactgcag gggaggacct ggtgggggtg gggactggct tcggtccttt cttggccgtg 52080 

cttcagctgc gcactctgcc cttcctccca cagatccact tgtgccgcaa gaaggtggca 52140 

agtcgctcct gtcatttcta caacaacgCa gaaggtacaa gcagctgggt gggaccaggg 52200 

tcgggttgga gtgtgtgcag cctctcaggg tggagctcag tggtgtcaca gcctggttgt 522 60 

gcttgcccgg tggggcggcc agtgcggcca tgtaccCggg ccctgtcttc tgactcgggg 52320 

ccacccatgt tagacttctg tgtggaagag ctcacacagt ggtctgagac agccagccgg 52380 

caagactgcc cctggctggt gcctggggcc ttggattttg ggaaggctcc ctccatttcc 52440 

tgatgagagg gtctccctgc acctaacctg ctggtgcaaa cagtaggggt Cttgctgaac 52500 

accggctttc tcttcgggga ctttgttgct tgcccagcag caggtgctcc agtgaccggc 52560 

cctcatacca tcttgggagg gtgtcctgga agccgtgtct ggcctcccgc gaccctgccc 52 620 

cgtgtgtctt tctcctgtgc tgaccttgct gcggaaaatt atggccctga gtgtgactcc 52680 

aggctgagtc ctgtgggtcc aacacgggat gccttggggc ctcttctgga gacgggatgt 52740 

gagtgacagg agccggccgg ggcagcttgc cctgtgactg cacgtggcca cagcctgtga 52800 

gggccggggg tgcttctcca cccacgtggc tgcccctcgg gtatgtcaag ggcttctggg 52860 

gctcatcacg gggtcctaga gacagtggca gggtgcaccc ccgctggctg cccttacagt 52920 

ttctgtgacc tgagggtggc atctgtgcag ccggcgcggt ctgtgcttct gtgggatcag 52980 

ggttccctct gtttcctgcc tcagttgggg ctcaagcctc aggtgaggtg gccccggagc 53040 

actcagaagg catcggcggt cctgtgggct gctttctgca ctcacgtttg ctgagtgctc 53100 

agtgtgccag gactgaggac cctgaagctg ctcttgtatt tagggcggcg ctcccctggc 53160 

agagactgag ccaggtggtc ccgcatgacc cactaccagg cgtttctggg ccctggccct 53220 

tggagggaca gggtgggcgg aacatgggcc tgcagggagg ctcccgctta ctggaggcat 53280 

gtgQtgtgtt gctggagaca tcctctgtgt tgcttcttgt tcgctgtggt ttttggtctg 53340 

gtggcaccaa ggaccctcag Ccatcttgat gtgtggttgt ccaggccttt ttgtcggtcc 53400 

taagaagggg ctctgccttt gtgcccccag gttccctgac aggagctgcc ggctcgtccc 53460 

ggtgatgcct gcaggacgtg actctgggac ggggggttgg gcagatgtgc tgatggaaat 53 520 

tctcaagcag gcgtcatttc cgaggtcctc acccggattc ccaggacagg agtgcctgct 53580 

gggtgtcccc agtcccatgc agcgggggtc cttgggatag catggaacgc tgagcatggg 53 640 

cctggccggc cgtggtcctg gacaagggca gtgccccggt ggctgctggg cctgggacct 53700 

ggtggggacg ctgggcctgg tacctggtgg ggatgctggg cctgggacct ggtggggagg 53760 

cctctgactg cctcctggtg ctgcttccgt ctgtgttagg cctctgggta ttggggcccc 53820 

catctgcctc ctcctccagg cctgtggact cagaccagga agacacaggc cagcccctgc 53880 

ctgtccccct tggcttgggc tctcactgcc cgacctggcg ggaggttgcc tagccgcgaa 53 940 

ccttcgcacc ctgtctgcca ccggacaggc tgtgaggggg tgtctgcagc acctgcaccg 54000 

gcctgagcat cttcagagtg ggctgcagct cctggagggg tctgagagga agggaggcag 54060 

gtattttggg cgaatgagga gacagctgga gagctggcac ccttcctggc ctgcgtcctg 54120 

tgaggactct ggttggggac agcaagcttg gggtcagcct ggggcagagc ctctgggacg 54180 

gccccgcccc tcgtgcccct tcccctcgca gctcctgtcc tcgccccgcc ctcagctctc 54240 

cgccaggcaa ggtctggcaa gtgccgctgt gcggcagtgc ctgctgattg gctggtctgt 54300 

tgctatggtg ctgcccaggg gtgtgctttt cctcccctgc cttccctgct atccctggga 54360 

gtatctgggg ttgggtcatc gctggtgtgt gtgagtgtgt gtgtgtgtgt atgtgcacgt 54420 

gtgcatatgt gtgcgctcct ggcctctgca gctgagtcct ggccctcggg gggcctggca 54480 

cctcctgggg acaggcacaa agcagccatg atggagtcgg gagctggggg aggccccatt 54540 

gccccacgtg gctgccctgt gactctgggg tgcttgttag aagaggtatc tggttctgtc 54600 

tgtgtttaag caactcccta aggaattctt gtggttccag tttggggggc ctgtactgta 54660 

gaggcaaggg aggggcagga catcccccag actctgactt ctgaagcctt ttctgcccgg 54720 

ggcctctccg ccagtacagg cagtgtcctt tgccagggct gccatgctgc agaggggagt 54780 

gggccactgt ttagcccagg aaaacctggc tctcccttag ctggaagttc tgggcctgtc 54840 

gcggttggca gggaagctga gtgacggtgc taatcacagg ggcacctgca ggggtttgtg 54900 

ggagatgcct ctgtgggttg gggcgatagg ctgaggggct gttcttccct gccctgagga 54960 

gggctgagtg tagccgccac tcctgtcctg tcttgggctg tctcggagag gatgcgtaga 55020 

accctcggga tcctgctggc ctccgtctgg tccaccctga acctcaggcc ttctgggggc 55080 

agaggaggat tccctcagga tcactcgggt gggggcctct cttgggcacc tgagaccctc 55140 

agtgggtgct ttgtggcgcg ttcacggttg gtgggggacg cccagccctg cccgccgtgt 55200 

aggagccgtt ctgtcctggg catccccctg tggtctggga cttagtggac cctgagggtg 55260 

tgtgtttacc cctgcctcac acctgcaigaa aaaagcctgg agcaggagct ggccagcccc 55320 

atcctggaca ttgaggactt ggtcaagagc ggaagcaagc acaggtgaga cccctcagtg 55380 

aggccacgac cactgtcctt ccatggccca gctctcctgt gacctgtgga ggcccggata 55440 

tatttcttca cttttctttg ttccttttta aattatgaaa ctaaccacca ttcagtacga 55500 

aaaagtttaa gcagctctga ggaagataga gtaaaaaatt gtctccctct tccctggccc 55560 

tcagccatcc ccggtggcca ccgtggagtg tggacggagc cctgcaggcc tgtgtctgtg 5562 0 

cggaagcacg cgcagttttg tctgcacaga ctgtcctgca gttggctgtt ttcactcagc 55680 
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gttgtgggta tagcttccca tgctggtgct ggcagctcgg ccttgttctt ttgaggacag 55740 

cagatgtctc ctatgtcCac ctcttacagc ttcagagatt caagttataa taaagctctt 55800 

cttatattga gggggaaacc tccctccccc ttttttttga aacagggtct cgctctgcta 55860 

cccaggctgc agtgcagtgt cacagtcttg gctcactgca gcctcagcct cccaggctca 55920 

agcgattttc ccacctcagc ctcccaagta gccgggactg caggcacgca ccaccatgcc 55980 

tggttaattt ttgtattttt tgcacagaca gggtctcact ctgttgctca ggccagtctc 56040 

ccgagctcga gagttccacc tgccttggcc tcccaaagtg ctgggattac aggcgtgaga 56100 

ccccatgcct ggccagctct tttttttttt tttttttttt ttgagacgga gtctcgctct 56160 

gtcgcccagg ctggagtgca gtggtgcgat ctcggctcac tgcaagcCcc gcctcccgag 56220 

ttcacgccat tctcctgcct cagcctcccg agtagctggg actacaggtg cccgccacca 56280 

cgtctggcta atcttctgta tttttagtag agacggggtt tcaccgtgtt agccaggatg 56340 

gtctcgatct tctgaccttg tgatccgccc acctcggcct cccaaagtgc tgggattaca 56400 

ggagtgagcc accgcgcccg gcccagctct gctttttctt agtggttctg cgttgtgttt 56460 
gtttctatcc aggaataggg ■ ttggttttac ttttccatcg agtttttaaa gagacgacga • 56520 

tttacatggt cggaaactca cgaggactcc ccatcccttg gtcggaaact cacatggact 56580 

ccccatccct tggtcagaaa ctcacgtgga ctcccatcca tcccaggcag cagcttccca 56640 

cctgggccct acgtgcagga tgagggctcc ttccgggtca gaagacatgg cggcctcggg 56700 

gcaccgtccc ctgcatgggg tgctcacagg atcttctccc ctctccttcc cagggtgtgc 56760 

ccttactacc tgtcccggaa cctgaagcag caagccgaca tcatattcat gccgtacaat 56820 

tacttgttgg atgccaaggt gggggctcag tcctgtagct gacgactcct gatgtccagg 56880 

ggtgtccctg ggcttgggaa cagctgtccg agcctttgct gcttcagggc cttagatcag 56940 

caggcctggg tgggaggact cacctctgtc actgggcagg ggctcaacct ggccagacac 57000 

acttgtgagc agccccaggc cacaggtcag Ctttctgagc agtctgggag cgggcaggct 57060 

ggtgggagtg aggagagacc tccaggctgt ggtccaCagg ccagtgcccg ctcccgatcc 57120 

tgacagctca ggttctctcc ttcacgtcag gccatgggag gcaccgagaa cacaggaagc 57180 

ccactgactc ccctcttccc agcgcgcgcc cggccccaca ctcactcccc ctcccagcat 57240 

gtgcccggct tcacactcac tcccctctcc ccagtgcatg cccggcccca cactcactcc 573 00 

ccccacagca tgtgcccggc ctgacactca ctcccctcct cccagtgtgt gcccagcccc 57360 

actcccttcc gccccgtgtg cccagcccca cgctcactcc ccccgccagc atgtgcccgg 57420 

ccccacactc aactcccctc ctcccagtgt gtgcccggcc ctgctgccct cctccccatg 57480 

tgccctgctt ttgtgcccca cacttcttac ttagtgcagg tgggatcaca cgccacgggt 57540 

caatggtttg tgtgttcacg tgacgatggc gtggtgacgt ttccagatcc cgtcgccggt 57600 

tcgctcattc tcggggtgta tatttattga gagctcatca tgctgggtgc tatcccaggc 57660 

atagcaagac tggcttcact cacatggagc tttgattcta gtggtgggga caggtggaca 57720 

gcaaaagagt aagcacgtga gctgacgata ctgaagggaa atagagcaga gggaggaggc 577 80 

ggagaccgag ccaagcgggc ccaagtgcga tgtcggcggg aggtggggaa tgctggtggg 57840 

tctgagggga gcctcagcag gtgcagcaga gcaagggaag aggtgagtgg gggcggctgg 57900 

ggggccgact cctgggaagc tgtagcagaa ccccacagag agctggtgag gtttgccgtg 57960 

gttgtgggcg actcggtgcc ccgagccctg gctgcccctg ggaaccatct ggagagcttc 58020 

taacccaacc aggcccctcc ctgggacagt tatatcacag ctggtaagcc gagtctaaca 58080 

ctttcacgga aacgcagaag atctaaaaca gcaagatgac cgtgaagaag aacagagctg 58140 

gaggactcac ctcgctggtt tcaagactcc tctaaagctg caggagtgga ggtggagatg 58200 

gcccagctca ggcacaggcc tgcaggccat ggagaaggca gcaagctcaa gctgacccac 58260 

acgcatgtgg tcattgtctt ttttttcagt tggaatctca ctctgtcacc caggttggag 58320 

tgcagtggca ccatctcggc tcactgcagc ccccgcccct aggttctagc gattctccca 58380 

catcagcctc ccgagtagct gggatcacag gcgtgcgcca ccatgcctgg cccttggtga 58440 

ttgttttttg acaaacacgc caatttaatc gagagaggaa atgaaggttg atttctggtt 58500 

ttctgaaaaa atggtgctaa gaacagctgg atatctgttc ggaaaacagt gaatcttaac 58560 

tcttgtttta ccctgtataa acctaaatgt aaaagctaaa ctaaaagtta tagaaaggaa 58620 

catgggggag gtctttgcaa ctttggggta ggcagagatt tcttagtatg gacacacaag 58680 

gcaccagcca tgaagaaaaa cattaaaatt tagacttcac caaaatttaa agcttcaact 58740 

ctgtggaaga gttgagaaaa tgaaaaagca gtcaaagaaa gggagaaaac acttctttca 58800 

aaggacttaa aaaatttttt cagccctcct ctgatttgaa aggacctttg accagagtat 58860 

gtaaaattct cccataacta agcaaacaac ccacccaacc actgggaagg gatctggaca 5892 0 

gacgtttcac caagatgggt ggaatggcca gttaaccact gggagagcat ccggacagac 58980 

gtttcgccaa gatgggtgga atggccagtt aaccactggg agagcatccg gacagacgtt 59040 

tcgccaagat gggtggaatg gccagttaac cactgggaga gcatccggac agacgtttcg 59100 

ccaagatggg tggaatggcc agttaaccac tgggagagca tccggacaga cgtttcgcca 59160 

agatgggtgg aatggccagt taaccacCgg gagagcatcc ggacagacgt ttcgccaaga 59220 

tgggtggaat ggccagttaa ccactgggag agcatccgga cagacgtttc gccaagatgg 59280 

gtggaatggc cagttaacca ctgggagagc atccggacag acgtttcgcc aagatgggtg 59340 
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gaatggccag ttaaccactg ggagagcatc cggacagacg tttcgccaag atgggtggaa 59400 

tggccagtta accactggga gagcatccgg acagacgctt cgccaagatg ggtggaatgg 59460 

ccagttaacc actgggagag catccggaca gacgtttcgc caagatgggt ggaatggcca 59520 

gttaaccact gggagagcat ccggacagac gtttcaccaa ggtggatgga atgaccagtt 59580 

gagcacatgg aaagtcgccc agcatctcca gtcataggag aaggcagact aaagccacgg 59640 

ggagccgaca ctgtggtccc actggcaCgg ccgaaattca gaagccctga gtgtggcatg 59700 

aggatgtgga acagctggac ctcatccatc gctgtgaagt tgtgcagcca ctccacaaac 59760 

gtgtggcaaa cagccgagcc gggagaaggg aagacgtgtc caaagattca tatgtggcca 59820 

ggctcagtgg ctcacgcctg taatcccaga actttagggg ccaaggctgg gggatcgctt 59880 

aagcccagga gtttgagacc agcctaggca acatagggag accccatctc aaaaaaaaaa 59940 

aaaaagaaaa aagaaaagac ttcagtgtgc aggtttacca gagttttgtt tgcagttgcc 60000 

aaaactggga agcagcccgc gtgagcccat ccacaggtga atggacagac cgtggtaccc 60060 

gaacactaac agcagccacg ggcgtggact gtggtcacac agcagcaggg agccgatgag 60120 

tctcggacat gctaacccag agaggcccat tgaggaggac ctactgtttt ttgtgttttt 60180 

gttttttgtt ttgaaatgga gtctcgctct gtggtgcagg ctggagtgca gtggtgtggt 60240 

cttggctcac tgcagcttcc gccccttggg ttcaaacagt tctcctg.cct cagccttccg 60300 

agtagctggg actacaggca cccgccacca cacccggcta atttttgtat tttcagtaga 603 60 

gacggcagtt cgccatgttg gccaggctgg tcccaaactc ctgaccttgt catccactca 60420 

ctttggcctc ccaaagtgct gaggttgcag gcatgaacca ccgcacccgg ctggacctac 60480 

tgttttattc catttatgtg acactctatt aatagaaaag gcaggggtgg ggctggtggt 60540 

tatatggtgc acataactgc cagaactcag tacacttaaa atgaacatct taatgtgtga 60600 

aatttttttt ttcgagacgg ggtcttgctc tgtcacccag gctagagtgc agtggtgcga 60660 

tctccactca ctgcaagctc tgcctcctgg gttcacgcca ttctcctgcc tcagcctccc 60720 

gagtagctgg gactacaggc gcccgccacc acgcctggct aatttttttt ttttttttgt 60780 

atttttagta gagacggggt ttcacagtgt tcgccaggct ggtctcgatc tcctgacctc 60840 

gtgatccgcc tgcctcggcc tccgaaagtg ctgggcttgc aggcgtgagc caccatgccc 60900 

ggccaatgtg tgaaaattta aaagtaccaa agctggaccc caccccagat tgctcccatg 60960 

acactctgtg ggtgggacct gggagttggg ttttgttttg ttttgttttg tttttgagat 61020 

gaagtctcac tctgtcgcct aggctggagt gcagtgacac aatctcggct cacattaacc 61080 

tctgcctccc agatgaaagc gattctcctg cctcagcctt ctgagtagct gggattacag 61140 

gcacacacca ccaccccctg ctaatttttg tatttttagt agagacgggg ttttaccatg 61200 

ttggccaggc tggtcttgaa ctcctgacct cgtgatccgc ccgcctcggc ctcccaaagt 612 60 

gctgggatta caggcgtgag ccaccgcgcc tggctgggag ttgggtttgt aaatctccct 61320 

gagtggggct ggggcaggga actgctgggt ctgggtcttc ctggctcctc tggtctgtgg 613 80 

cttcctgact gcggtggccg ggggctccca gggcatcgtg gccgtctgtc ttgctgagcg 61440 

tggcacgtgc ctttccatgc tgtggaggag cgtctcccgg tatggcgaac tgctggttag 61500 

ggtggggcgg tgttgccagg tcatccaggt ctggcctctg ctctcgacat cgccggcgct 61560 

gttgctcatc tgcgcttgtg atgttcgatg cctgctgcac atgtcttggc ttccctcttt 61520 

cccggcctct gtgagctcca gcgctgcgtc ccttctcttc ctcctgtaga gccgcagagc 61680 

acacaacatc gacctgaagg ggacagtcgt gatctttgac gaagctcaca acgtggtgag 61740 

tctccgctgg cctcctaaac acctcctatt gcttctggcc tttttgtcaa gagccacgca 61800 

aacctttctg gaggggctct ggccaaactc ctgaagccct aggtgcccag gactggggac 61860 

tgagcacacc aggagcttct gccaccccct cccgccctga tccgatgcct ctgctggggc 61920 

tggagactgg ccagctgggc cagggacctg cccgtcaggc gcagggcccc cacaggccgc 61980 

tcaccagacc ctttccctcc agccagctcg gggtcagcct gggccagggc tgtctcctct 62040 

gccctcggca gcagcaggct tgtggtcttg cctgcagtgt ctctgccctt ccggceacat 6210O 

ggcttgagac tgaggcagga gaatcgcttg aaccttggag gcagaggctg cagtgagcca 62160 

ggatcacacc actgcattcc agcctgggtg acaaagcggg attctgtgtc aaaaaaaaaa 62220 

atgttgactg ggcgcgctag ctcatgccta taatcccagc actttgggag gctgaggtgg 62280 

gcggatcacg aggtcaagag atcaagacca tcctggccaa catagtgaaa caccgtctct 62340 

actaaaaata caaaaaaatt agctgggcgt ggtggcgtgt gcctatagtc ccagctactc 6240O 

aggaggctga ggcaggagaa tcactcgaac ccaggaggta gaggttgcaa tgagccaaga 62460 

tcacaccact gtactccagc ctggtgacag agcaagactc cgtctcaaaa aaaataaaat 62520 

caaaaagaat aattggcaat tccagtgaaa taattgtttg tttgtttgtt gagacagggt 62580 

ctccttctgt cgtccaggct ggagttcagt ggtatgatct tggcccactg caacctccac 62640 

ctcctgggct caagccatcc tcccacctca gcctcccgag tagccgggac tacaggtgca 62700 

caccaccacg cccggctaat ttt,tgtattt tttgtagagg cggggtttcc cagcgttgcc 62760 

caggctggtc ttgaacccct gagctcaagt gatctgccca ccttggcctc ccaaagtgct 62820 

gggattacag gtgtgagcca ccgcgcccgg cctgaaacaa tcgtttctaa atattggtgt 62880 

gggccacaca gtcatgtttg gacctacttg tggcctttta cagaccccag gccaaggctt 62940 

tgggaacttg gctgtcagcc tcctgtgcct tctgcacccc caccccattt ctgctttctg 63000 
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gaacccccga tcctgtcctg ttctgtggtg attcgggtgt gcCtgggctc taggagaaga 63060 

tgtgtgaaga atcggcatcc tctgacctga ctccccatga cctggcttca ggactggacg 63120 

tcatagacca ggtgctggag gagcagacca aggcagcgca gcagggtgag ccccacccgg 63180 

agCtcagcgc ggactccccc agcccaggtg cgttcatagc cagactgctt ggtcctgagg 63240 

cctgcgctgc tgcagggtga gccccacccg gagttcagca cggactcccc cagcccaggt 63300 

gcgttcatag ccaggctgct tggtcctgag gcccgtgcta ctgcagtggg cagcctgccc 633 60 

tgtggctgtg tgtggCcggc ctgggcacca tctattcagg ctggcactgc agggcatccg 63420 

cttctcccag aggcttcttg ggtgtgaatt cttcagggtc ccgtagcctg tggaagggct 63480 

ggtattgttc agtagttctg gtattttcca aagacctatg tcttccccca gccagtatca 63540 

acttggcctc taccgtgtaa aactggaaaa ctctactctg tgaagctgag ttgggagcat 63600 

cgcttgaggc caggagtttg agaccagcct gggcaacatg gcggaacctc gcccctgcca 63660 

aaaaattagc caggtgtggt ggtgtgctcc tgtggtccaa gcttttctgg aggccgaagt 63720 

gggaggcgtg ctcgagcctg ggaggcagag cttccggtgc cccagatgac tccactgcac 63780 

tccagcctgg gcggcagagt gaggccatct caaaaaaaaa aaaaaggaaa actaaatata 63840 

ttcactgtaa gggcactttg catccttaaa tgacccacaa atctggcatg catcagctgc 63900 

tctgcctgta ggttccttcc cagcgctcgt ccagaggtgt atttccacac agcgctagtc 63960 

acggcatatg tggaaaacgt ggaaaccctc catggatgtt gtcagttggt ctatatttcc 64020 

tttctttttt ttttttttga gatggagttt cacttttgtt gcccaggctg gagtgcaatg 64080 

gcgcgatctt ggctcactgc aacctccgcc tcctgggttc aagcaattct cctgcctcag 64140 

cctcccaagt agctgggatc acaggcgtgc accaccacgc ccagctaatt ttgtattttt 64200 

agtagagatg gtttctccgt gttggccagg ctggtctcga actcctgacc tcacgtgatc 64250 

cacccgcttc ggcctcccaa agtgctggga ttacaggcgt gagccgccac gcccggcctt 64320 

tgtccatatc ttccacacgg cttctgtaaa cagctgacta ggagtctgtg tgaatatctt 64380 

cataggttct gctgtgacac tacttgctcg tgagcatctc caggtgtaaa cagcatcagc 64440 

ttcccccatt ttcctttaaa atcgcacatg tggacggaca ccacggggac cctggaccct 64500 

ggggagcccc gtcctcaccc ttctcaccag gatggctgct tggtagagag tgagtttgca 64560 

aagttggcat ttgtttagta cagaagttat caggtgttcc ggctttagaa tccctttata 64620 

tatatatata tatacatata tttaagtgac agggtctcac tctgttgccc aggctggaat 64680 

gtggtggtac aatcaaagtt ccctgtagcc tcggcctect gggctcatgg gatcctcccg 64740 

tctcagcgtc ttaaagcgcc gggaccacag gtgtgcacca ctgccaccgg ctctcaagat 64800 

tgccacgcag ggagttgcag tgggggaagg ggttcctggg actttgaacg ctccacctcc 64860 

ctcctctcca cagtccccca accccacctc tctaacgggg tggacggccg cctctttcca 64920 

tccttcgctt ggcgcagggt ggggagagtg acaggtctcc ttccctcatc tcggcagctg 64980 

ccatttcatc gcttacataa cgtgggagaa acatccaccc acccccaggc ctgtgtgaac 65040 

atcaccacgg ggccttctcc actcttcagt tttgttagtc acttgatgtg cagggctttt 65100 

tgttgtaact agtgggggac gtgtggcggg gtgggcttct gccatctcat ccaggaccag 65160 

aacttcagtt ttcatcccta tctgttcccc cacccccttg gagatggggt ctcactctgt 65220 

cacccaggct ggagagcggt ggtgccatca cggctcactg cagcctccac ctcctgcagc 65280 

ctccacctct tgggctcaag tgatcctccc gcctcggcct cccaagctcc tgggaccaca 65340 

ggcgtgtgcc actgCgcttg gcagggtcca ttcttttcct cacactttat ttattgaaga 65400 

gcccaggccg tttaccctgc agagtcggaa tctgtacagg aggggcagcc acacgagttc 65460 

cccggtctac tctgaactta ggtggcttga gggccccagt tagactgcgg ccaccgtttg 65.520 
ccgggctcca gatgggacgt cctttctatc agaaggctca cagtatctcc tttcccgttt ' 65580 

cttcccatgt gaacattgtt gctgctgaac acctgaatat gttaatcact gggggcttgc 65640 

aagatggcag tgtgctaatt ccatcatcta gtcagttagc aggaataact taggaccacg 65700 

ccctgcacca tatcagctat gtggtgatcc cattcacaca ggaaaggtgg gacaaatgct 65760 

gggggtgggc cgggtgtgct gtctcacacc tgtcatccca gcactttggg aggcccaggc 65820 

aggcggatca cgaggtcaga gattgagacc atcctggcca acacggtgaa accccgtctc 65880 

tactaaaaat acaaaaaaat tagccaggtg tggtggtgca tgcttgtaat cccagctact 65940 

tgggaggctg aggcaggaga atcacttgaa cccaggaggc ggaggctgca gtgagccgag 66000 

atcgcaccat tgcactccag cccggcaaca gagcgagacc ccgtctcaaa aatcaatcag 66060 

tcaatcaagt gtcatcactg aatgtttgtg tgtgaacgtg gggatcggtc ctgccccatg 66120 

ctccctcctg aatctcactc ctgacctcag ttgctgcacc ttgaggtgtt ttctgtgggc 66180 

tcttgtgtcc tgaccccggc ggttgtggcc cctgctgtct gggagtcagg atttttcaca 66240 
ctcatgtcct gctceagacc tggaatcagc caagCctcca agaagccctg ctttctttcc ' 66300 

ctgcaagacg gtatttcaag acccgccgtg cggcagcggg ttggtcatgg ttactgggtt 66360 

ggtcgttgtg actgggtgtt ttcgCggaga tacagccata cgcacaggtg tgttcacaaa 66420 

tgCtaattct aaaggtcaaa cacccggcca ggcataaggg ctcagcggta atcccagcac 66480 

tttgggagac caagactggt ggatcacctg aggtcaggag . tttaagacca gcctgagcaa 66540 

cagggtgaaa ccccatctct actaaaaatg cgaaaattag ccgggcatgg tggcgcacac 66600 

ctatagtccc agctagtcgg gagacagaca cgagaattgc ttgaacctgg gacatggagg 66660 
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ttgcagtgag cagagatggc gctgctgcac ccctgcctgg gtgacagagt gacaccctgt 66720 

ctcaaaaatg aatagataaa taaagataaa acacctgctc ctcttggtgt ccccagtttg 66780 

gacttggcct gtgtagcctc ttccttcgcc tgttggtgga tttggcctgc acggattctg 66840 

tgtggcctct tccttcccct gttggtggac ttggcctgca cggattctgt gtggcctctt 66900 

ccttcccctg ttggtggatt tggcctgcac ggattctgtg tggcctcttc cttcccctgt 66960 

tggtggattt ggcctgcacg gattctgtgt ggcctcttcc ttccccCgtt ggtggatttg 67020 

gcctgcacgg attctgtgtg gcctcttcct tcccctgttg gtggatttgg cctgcacgga 67080 

ttctgtgtgg cctcttcctt cccatgttgg tggatttggc ctgcatggat tctgtgtggc 67140 

ctcttccttt ccatgttggt gtcctttttt ccatgccagg aatcctggtt ctcaagggcg 67200 

gggttgccgg cacgagcgtg aCgcagactg cctttgctgc ctttctcttg cccagggctg 67260 

aacatggagc tggaagacat tgcaaagctg aagagtaagt gttgccctcc ccgcctcctt 6732 0 

gcagctgggt ggggcctcct ccttgcgagg aggtgggtga cacctcctcg acccacagtg 67380 

atcctgctgc gcctggaggg ggccatcgac gctgttgagc tgcctggaga cgacagcggt 67440 

gtcaccaagc cagggaggtg agaggcgggg agccagcccc ttcactgcag gcccagccta 67500 

gagctagaaa cgggccatgg tgcagtcctg ggctgtcaca tcacgagtga ggcctgtttt 67560 

caggcctgtt ttcccttttt gagacctggg aggagcacct gctttgcatg atctggttgc 67 620 

tgagatgttg agaggagcag cacacactcc cacgggacag cacacagccc cccacggaac 67 680 

ggcacacaca cccatggaac agcacacaca ctcccacgaa cagcacacac actcceacga 67740 

acagcacaca cactcccacg gaacagcaca cacacccacg gaacggcaca cacacccacg. 67800 

gaacagcaca cacactccca cggaacagca cacacaccca cggaacggca cacactccca 67860 

cggaacagca cactctccca cggaacagca cactctccca cggaacagca cacacactcc 67920 

cacggaacag cacacacacc cacggaacgg cacacactcc cacggaacag cagactctcc 67980 

cacggaacag cacacacact cccacagaca gcacacacac acccacggaa cagcacactc 68040 

tcccacgcgg ggccgctggg tttcctgcag tttctcctcc tccaggcctt tccctggacc 68100 

ctggtccagt ccgtcatttg agcacaggtg cctgttagaa cgagaccttc ttgttaggac 68160 

gatgagtgtc ccagccacca cctcttttgg actccgggag gcctggaacg ttctgaacgc 68220 

tccgtggggc tccagtcttc tccgcagcca gggcagcagg gtttgctgtc tgtcctgcag 68280 

gcagatgagg agtcagggct ggggcctgtg tgggggctct cctgagcgcg cagccgccga 68340 

ggtggagcgt gttctgcctg agcgccgacc tggtcggggg aatcccagtt gctcccaggt 68400 

ggagccactg tcctcagcgt aatgctcaag gctctggcct ggctcctcgg ccaccctgca 68460 

ccctcagggt cccctcctgt agcttctgct gccccatcac tgtcactctc caaagctttg 68520 

gggactctgc ccagagccac cgcctcccag aagcccctga caacctcttg a'cgaccccct 68580 

agtgacccca tccctcccct ctgacggcgg cccctgctct gaggcggctt cttttcctcg 68640 

gtgctgttct cgtgctggcc aggcctccCc tccccacctg gaggctcctg agggcggagg 68700 

cctctcacct ccaatgctgg cgtcccctgg agggctgaat ttgtttccga gggaaggaaa 68760 

cttccacagt tgttgccttc agttccaaag ctgcagcctg atttccccct ccaggctcga 68820 

gcctgttttc ttctcggcag ctacatcttt gaccagtgtc gtcccccctc aggcccgagc 68880 

ctgccttctt ctcctcagtt cccaaagctg cagtctggtc cccccgccag gctcgagcct 68940 

gccttcttct cctcggcagc tacatctttg agctgtttgc tgaagcccag aCcacgtttc 69000 

agaccaaggg ctgcatcctg gactcgctgg accagatcat ccagcacctg gcaggacgtg 69060 

agtgctggca cggggtcttt ggtgcgggca aatgtggcgt agggggtgca gcaggcctcc 69120 

atcttggcag tcagggctcc cccggccgtc acctggccgt cagcaggaac aggcccacag 69180 

aacctcatct tctgatcggg gcgtggaggc gttagtgcca cttgccagct gccgtagagc 69240 

ctgtcccagt tctgcagctg gcggcttcgt cctacagcct catcccatta ttctgctttt 69300 

gagaaagagc agcccaaggc cctagctggc ttgtggggcc tctggcttct ccacaccacc 69360 

ccgagttctg cttctcagag ttgtggggtc cagaggcttt gcccagaggc ggcgtcccca 69420 

tgggctgctc tggtttgaga cgccgggccc agcggggtct ctcctctgct gcgctcccgg 69480 

gtgctgggga gggtggcttt tgctgcttca acccttaggc gaccatagag cctcttttca 69540 

agtcccactg acccccttgg agactctgtc cctgcctggc ttctctcctg gctgctggga 69600 

agagcaggcg aactgcccgc cctgaatgga tgctgcgctc caccctgggc cccccatcgg 69660 

gcaggagatg gagcctggca gtcgggctga gcgggctcat gctggaaggg ccggggccgg 69720 

ggtcggggcc tcccccgcct gcagtgtggg tgtcagcgcc ctgccgccct ccaggtgctg 69780 

gagtgttcac caacacggcc ggactgcaga agctggcgga cattatccag gtggggcctg 69840 

ctcctctgtg gcatctcctt ccctgatgga agccgggcgg gtgccttctc ctgctgtatt 69900 

agttaactga ttctagactt ggggatggga gaaaggcccc tacaccacct gtttctgatt 69960 

ggcaaactct cggctccttt ccagtgccct aaacccacac tgggcctcct gcagggatgg 70020 

gggaggacga ggtctggtgg cacatgccca gggtgatgct ggtgagggag gacgcaaagg 70080 

acagtggggg ccggggagcc gctcctgccc tgtccgggcc ctcaggccag gggggaccca 70140 

ctgctggcag ccccagcagc cccagctgca cgcagatgaa gagctctgga cacacgcggc 70200 

ttcctgaaca gcttctccag ggacagacaa atggggaccc tgcaggttcc cggcaggggt 70260 

gtccctggga gcccatgatt gggggtgcga ccctggcccc cttctcattg gccccgtcct 70320 
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gtcctgcaat 

gctttccctg 

tctgggcacg 

ctgactgggg 

tgcccacgct 

cgctatgaca 

gcagaagaac 

tcccggaaca 

cctgggcagg 

ccactgtggc 

tggggctgga 

atcccaggcc 

cgcatggagt 

tccatgtccc 

tgcagcccag 

cagcttccca 

aatgtggctt 

ccagctatgg 

cccagctatg 

ccccagctat 

ctcccactta 

gcccccactt 

tggaatttcc 

ccctcgggtg 

cccctgcagg 

tcacaggtga 

acagctcagc 

tagctggagg 

ccccgtgaga 

aggcacaggg 

ctggagtgca 

cctcttgcct 

atttttaaaa 

tcctgggctc 

gcttccatgc 

gccctgctca 

gggccaaatg 

gctgggaagg 

cgtgcacatg 

tgccaagtct 

tccgcatttc 

gcggcggggc 

ggctttgcgt 

tcctctccta 

tgaggatccc 

ctctaagttc 

gacacacata 

ggtgaacaca 

ctaaaccggg 

ggcctccatg 

ttcccagcac 

tccactgggc 

tcctggtgtt 

gttgcccagg 

ttcacgccat 

cgcctggcta 

gtctcgatct 

accgtgccca 

ctcgaactcc 

ggtgtgagcc 

tagtgcatct 



gcccgtccca 
gcgtagaggt 
ttgcctctcc 
aagccaaggc 
gtacctgccg 
gcttcatgag 
cctgggatgc 
agcaggcact 
gaccaggtgg 
cgtctctgtg 
aggacagcac 
cagacccctc 
gtggttggct 
tgatgggtca 
ggcctggagg 
ggctcccact 
ctgaggttcc 
agaagactcc 
gagaagactc 
ggagaagact 
aggagaagtc 
aaggagaaga 
tttggctgct 
cctggaggat 
tgtgggcggt 
ccccgggggc 
tggaggcgaa 
caaactctgg 
ctgctgggtc 
ggatttgtgg 
ggggcttgat 
cagccacccg 
tctttttgta 
aggcagtctg 
ctgcccagca 
tggcagccac 
ctttggtaaa 
aaggaaggaa 
catgcagagt 
ctgtgtccca 
tgtgaatgat 
tctgacggcg 
tcctgttctg 
gtgcaggcat 
atatacatat 
tgaggatccc 
ttctttcctt 
tgcttccttg 
tcctgtgtat 
ccttctgacg 
gccacaccgg 
acctgccaca 
ggcctgaacc 
ctggagtgca 
tctcctgcct 
attttttgta 
cctgacctcg 
gcctgatatt 
tgacctcagg 
accacgcccg 
tggagtcagc 



tgtgaggtct 

gcttgtccgg 

ggggcctcgg 

acaggtggct 

gccacacgag 

tgtccatctg 

atggctggcc 

gccttctcct 

ggtgggcaca 

gccagtgacg 

tgccttgtcc 

tgtctgtggc 

cttcaagcag 

ctgcaaagca 

gcaggtgtga 

tatggagaag 

tacactcgaa 

acactctgga 

cacactctgg 

ccacactctg 

tccacactct 

ctccacactc 

gctctaagta 

gagggactgc 

gcccataggg 

agagtggggc 

ctccaggcag 

gcagtgctgg 

ctcatccctc 

tggtgttttt 

cacagctcac 

aggagctgtg 

gagatgaggt 

cccgccttgg 

gaaggctttt 

cagcgtgggg 

acacaaggcc 

cggaaggaaa 

acgcacacac 

gccgtggtgt 

tccggcttct 

gtggcttggc 

gccgtttttg 

cattcctttc 

actctctgta 

atatacatat 

gtctcatgct 

tcagattttt 

tctccaaact 

tcacctgtgg 

tggttctgtg 

cttttcctca 

aatttctttt 

gtggcgcgat 

cagcctccca 

tttttagtag 

tgatccgcct 

tttagtagaa 

tgatcctctc 

gcctcttgtt 

gtgtctactt 



gcttctggct 

tttgtggagg 

tggctttttc 

gtgtggagcc 

atcatggcag 

gcctgtgggg 

tgtgcatgct 

tcacagacgc 

gggtggtggg 

ccacagcctg 

tcccaactcc 

tcctgcctgc 

gaggccgtgc 

cctccagcac 

gctggcccgg 

tctcctccac 

ctgaatcctg 

accgaatcct 

aaccgaatcc 

gaaccggatc 

ggaaccggat 

tggaaccgaa 

gccgctggtg 

acacagtgct 

ctggtgccgg 

cagtggcccc 

ggtcaggccg 

cccgtgttgg 

tctgcctgag 

ttgagaaagg 

tgcagcctca 

aacacaggtg 

cttgccatgt 

cctcccaaag 

cgaaggaagc 

gtgaacttgt 

ctgtgcttag 

tttctctctc 

gcacgcacgc 

ctgccccccg 

tggtgttccc 

tgacatggcc 

gctcgttttc 

ccattgattt 

agttctgagg 

tctctctcta 

ggtcattttt 

gttctgagct 

gcactgtaga 

gggtgtgaaa 

cgctgaccga 

tgttttcgtt 

tgtttgtttt 

ctcggctcac 

aatacctggg 

agacgaggtt 

cccaaagtgc 

atggggtttt 

accttggcct 

cttttgaaac 

cctgtaaaaa 



ccatgcctat 

gcacgcccca 

agaagcagac 

ctgtgaggcc 

ggttaggcag 

tgcttgagct 

gctgggcatg 

age tct gage 

gcccaggctc 

tgtcttctct 

tccccaaagg 

caagggctgc 

acctatcagg 

atggccaggc 

gcctgtccct 

actatggaac 

gaatgcggct 

ggaacgcggc 

tggaacgcgg 

ctggaacgcg 

cctggaacgt 

tcctgcacac 

gatgactcag 

cacccgcgtt 

gttgggcctg 

aggaagagga 

tgtgctcgga 

ggaaccagtt 

gccggagctg 

gtctcgcttt 

acctcctggg 

tgcaccaccg 

ttcccaggct 

tgctgggatt 

tgtttcctga 

tctgttactt 

ctgtcttgac 

cttctgtgcg 

ctgcacaaat 

gtgttctcta 

agcagaactc 

acattgctga 

caggaacggt 

gcagggttct 

atcccatata 

agttctgagg 

tccattttca 

tgtgcctccc 

gtgacggagc 

ggcagactct 

gcggctcggc 

cactgtggtt 

tgagacagag 

tgcaagctcc 

attataggca 

tcaccgtgtt 

tgggattaca 

gccatgttgg 

cccagagtgc 

ctgccctgac 

tcttactgtg 



ggcagcacct 
ttttgggtgc 
tcagaagtcc 
tcctctgtgc 
ggctgcccag 
gggggaggcc 
gagctgcaga 
gggggcgaga 
agccctccct 
gtgcggtagc 
cacggtgggc 
tgtgctgtcc 
cggacctgct 
gaggtagccc 
ccctggaata 
tgaatcctag 
tccaaggctt 
ctcccaggcc 
cctcccaggc 
gcctcccagc 
ggcctcccag 
tccatcggtt 
cttctgccag 
ggctcctgag 
cagccctgag 
tgtgggatgc 
agtcagggct 
gcccctgggc 
ccctgggctg 
gtcaccccgg 
cccaagtgat 
cactcagcta 
ggtctcaaac 
acaggcaaga 
. ggcagactca 
ccatccccgt 
agtgaaaatg 
tacccaggca 
ccacgcatgt 
gttcggcttc 
cctcaagtct 
gcctgttggg 
cgtcacgcgc 
ctgtaagttc 
catattctct 
atcccatgcc 
tgacaggttt 
gaccaagatg 
tttgtgtctg 
accttgattt 
ttcccccaac 
ttgtcgtaag 
ttttgctctt 
gcctcccggg 
cctgccacca 
agccaggatg 
ggcatgagcc 
ccaggctggt 
tgggattacg 
gttttttcca 
attttgacta 



70380 

70440 

70500 

70560 

70620 

70680 

70740 

70800 

70860 

70920 

70980 

71040 

71100 

71160 

71220 

71280 

71340 

71400 

71460 

71520 

71580 

71640 

71700 

71760 

71820 

71880 

71940 

72000 

72060 

72120 

72180 

72240 

72300 

72360 

72420 

72480 

72540 

72600 

72660 

72720 

72780 

72840 

72900 

72960 

73020 

73080 

73140 

73200 

73260 

73320 

73380 

73440 

73500 

73560 

73620 

73680 

73740 

73800 

73860 

73920 

73980 
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gaacgtgttg aatccctgtt tttttcttga gtcagggtcc ctcCgttgcc caggctggag 74040 

tgcagtggga ccatcacagc tcactgcagc ctcaaccccc tgggctcagg ggaccctctc 74100 

agctcaacct cccaagtagc tgggaccaca ggcacatgcc accatgcccg gctaggcttc 74160 

tttttttttt tttttggtga acaccctggg gttgcaccat gttgcccagg ctggtctcga 74220 

actcctgggt tcgggcagtt tgctcctctc agcctcccgg agtgctggga ttacaggcct 74280 

gagccactgc actaggccat gttgaatttc tagattaatt tggggccctc aggggcacag 74340 

agaggagggc tgggccagtt ggcgggagga gaggcccctc gggctgccgc attttcagtg 74400 

catggagatg gcctatgttg ggggaacaca gagctcaccg ggggtccctg cagggaggag 74460 

aaagggtcag gcaggtgcca gctcctgtcc attggcctgg ggctgcatga tggcaggggc 74520 

cggtgaaccg atgacccctg ggtgtcctgt gaccttctgt gtatgcggct gatgctgcag 74580 

aaagtcgggt ggcctcaggc tcctgacggg gctgcacttc ctctgccttt cagattgtgt 74640 

tcagtgtgga cccctccgag ggcagccctg gttccccagc agggctgggg gccttacagt 74700 

cctataaggt aggggccacc tccaggaggc aggtggaggg cagcccttgt tccccggcag 74760 

ggctgggggc cttacagtcc tataaggtgg gggccacctc caggaggcag gtggggctgg 74820 

gggtcttctg gtcctaaaag gtaaggggct gcccccagga catgggcggg gcctccacac 74880 

tcctggtcct gtcccctcca ggtgcacatc catcctgatg ctggtcaccg gaggacggct 74940 

cagcggtctg atgcctggag caccactgca gccagaaagc gaggtacaga cctgggccca 75000 

cacgctcccc gcccgcccgg gtgcagtgcc cggcaccacc atgccacagg ctaggcacat . 75060 

gcccagccgt ggatctcctg cccccatggg cctggccacc ttccccatat ccaggccaat 75120. 

ccagagcatt ctcctcactg tccctctgaa gattggagtt actgagagac gtaggagatg ■ 75180 

gcctgatggc accgtgaccC gcccagagtc acctggttgg tggtggcaga gccacagccc 75240 

agccaggcct ccctgctggg acacgctcgt ttatgccgag gccgtcagca cagagcctcc 75300 

acagtgaggc acggctctgc ctgctgcctc cacgcagcgc ctggccgggc caagcctcag 75360 

ggtcacatct gaagggggcc cggctggccc tgttgtccga agcccctggt gcgctcagcc 75420 

ccgaggcccc acgtgccttc ttggcttcct gtgctccgtg gcgtcttcga gtcggtgctg 75480 

ccggggacgc tgtgtggatg gggtctgtga gtgtgccctc ggctccgtgc ccggagccct 75540 

gtggttcttg gggtgtatct ggccccaccc ccactgcgtg gtgtccaggg Cggggcttca 75600 

cggctgcagc tgcgggagct gctgcccccg ccttgtgctc cagtggggcc ttgcctctgg 75660 

gcttggttcg tccctctctg gaacattctt tctcagctgc tgtccgaccc atggtggcat 75720 

gacgtggccc tggctgaagc agcccttgtg cggttgctgt ggttgggtct gcctggccga 75780 

gccggaaggg aagggctggg agggcgtcag ggtggcgtgg cttgaccccc gctcggtgat 75840 

ggccctgcag caaggcctct cccagcagga agcgtccatc ccggggggag gccggcgccc 75900 

ctcacgcagt tggggttgcg ggaggcagtg cgtgcctgag gcagccggtg cacagattcc 75960 

aagggcctgg aatctgtttg ttccattgac ctctgatgtc acttgacttc tcagaagcag 76020 

ccactccctg cactgggcgt ttgtaggaaa tgagctcctg gaggaggggg tggggaagtt 76080 

cccccattgc agggcacact cagccccagg aaggaaacgt gcctcgtccc tgctgactcc 76140 

gaatcgcagt cagagtcgtt ctgcttgtgc cgtgttgaat tcccggcatc cggcatccag 76200 

actcagcctc ctccccaggc cacggccgcc gtggccagtc ggtcaagccc ttctaggaac 76260 

ttcctttgag ctggcgccct tgttcactgc tgacgccact cagaggctcg tgcacgtgtc 76320 

ctgcttccag gcagagctgg gaacccgcac cccgtcctcc gcacgcggcc gtggaatgtc 76380 

gggatgccgg cgcttccttc ccgtgtgctc ttggcggggt gggcttcttg ccccgagccg 76440 

cacgtcacag tttctgcaga agtttagggt tggagtgggc tgacctctct gcaggtgtcc 76500 

ccagcctctg cctggggtct gcctcctact cccaggaccc cctgtccccc agaggggccc 76560 

caagctggca ggctcacact cagggcagcc tcctttgctc tgacttctgc acagtgggcc 76620 

tgggtggctg cccgcggctc ' gcttgcttga tgccagtggg tggagagggt gatgggcaga 76680 

gaggcaggtg gtcaggcccc cagtcccgtc ctcacactct gtgccctctg ccgccccccg 76740 

ccccacaggg aaggtgctga gctactggtg cttcagtccc ggccacagca tgcacgagct 76800 

ggtccgccag ggcgtccgct ccctcatcct taccagcggc acgctggccc cggtgtcctc 76860 

ctttgccctg gagatgcaga tgtacgggcc acccctgcca gggcccgagc accggtgaca 76920 

cctctgacat cagcggggtg gaagtggtgg gggtccccat gagccgggtg ctgggggtct 76980 

cgggcctcga gggctaaagg ggtgctggtg cacttcccca cCgtctgctc cctctggcca 77040 

cgctcagccc tttcccagtc tgcctggaga acccacacat caccgacaag caccagatct 77100 

gggtgggggt cgtccccaga ggccccgatg gagcccagtt gagctccgcg tttgacagac' 77160 

ggtgagggcc tgtccctggg ccctgctggg gtgggaggtg ggggagcact gaggcctgag 77220 

gccctgagca gtggcctctc cggctctagg ttttccgagg agtgcttatc ctccctgggg 77280 

aaggctctgg gtgagtgccc tgaatgcccc agctgtgcgc atcctggatc ctggacccct 77340 

gctcccaaga gctggtaggg acccctgcag acatcctgcc cccgecttga ccccggcccc 77400 

tgcacttcca ggcaacatcg cccgcgtggt gccctatggg ctcctgatct t'cttcccttc 77460 

ctatcctgtc atggagaaga gcctggagtt ctggcgggtg cgtctcccct gtgttctggg 77520 

cggggtgggt gagggcaggg ctggagcatg aagcaggcag tggtcacagc tcctgcttgc 77580 

cctcatcgga tcggcggcgt gaccagggct gccgtgcccc tgcctcttcc tcccacaggc .77640 
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ccgcgacttg gccaggaaga tggaggcgct gaagccgcCg tttgtggagc ccaggagcaa 77700 

aggcagcttc tccgaggtcg gcacCtggcc ggggctctgg gcctgctgcc ccctcgCgcc 77760 

tcccctgcct ctcacagctt ceccaaggct gaccactggc cctgaccatg ggctccggcg 77820 

gctcccgctg ccCcttcagg gctcctgcgt ttccttcctg gccctgagtg ttgcctctta 77880 

tcttacaaag cccccagcac cgggtgggtg tggcaacagt ggccctcctg cctgagtagc 77940 

cccagtcggc caccctggcc ctggggttcc ccgtgttttc tgggaagcac tgagcaggcg 78000 

tggggtcagc ctgggatccg tgccaggaag aagctcccag aacccgattg gccttcctgg 78060 

ctaggacgat ccttcatctt ggagcatgag acctgggtct ccctcatggg ggaggaaggg 78120 

gctggggggg ggctccaggc tcagcctcac caactttcct tccagaccat cagtgcttac 78180 

tatgcaaggg ttgccgcccc tgggtccacc ggcgccacct tcctggcggt ctgccggggc 78240 

aaggtgagcc ctccagggcc ctctgccccg acctggctgc ctgttccctg gtgggtgctt 78300 

atggctcccc agcagactct gggccctggg ggctgcccgg tcccctcctt gggtcccacg 78360 

agagcgactg ctggccctgc tgggagcgtg tcctgctctg ggcctgggca ggcaggatgg 78420 

gagtttcctg gccacaagag ttggaggtgg cgtctgggag ctgtggaccc caagtggggt 78480 

cctgacccac agatggagct tcctcccacc cctggCtggg gacggagcct cggggaaggt 7854.0 

ggctgggctg ggtgtgggca ccagggagag gagcccccac ggccccaggc agctccctgg 78600 

tgtgtcccct aggccagcga ggggccggac ttctcagaca cgaatggccg tggtgtgatt 78660 

gtcacgggcc tcccgtaccc cccacgcatg gacccccggg ttgtcctcaa gatgcagttc 78720 

ctggatgaga tgaagggcca gggtggggct gggggccagg tgagttacag cagggcgggg 78780 

ctggggtaag gcggtctggt gactgagccc ccgccccgtg gccaagggag cccccgtgac 78840 

cgagccgcct cgccccacag ttcctccctg ggcaggagtg gtaccggcag caggcgtcca 78900 

gggctgtgaa ccaggccatc gggcgagtga tccggcaccg ccaggactac ggagctgtct 78960 

tcctctgtga ccacaggtgc gtgcagtccg gtggcaggcg cggcgccagg ggacacgccc 79020 

acaccccact gggcccctgg actctccttc cccacatgag gccccgtctc ctccagagcc 79080 

tctccggcta ctcggggtca gcgtggggcc cctgcagcag atgagggtct tcacttcggt 79140 

gaactgaacc cttgaagcgg ctgtgggcag ggcagcaggg ctatggccac cccccaggtt 79200 

cgcctttgcc gacgcaagag cccaactgcc ctcccgggtg cgtccccacg tcagggtgta 79260 

tgacaacttt ggccatgtca tccgagacgt ggcccagttc ttccgtgttg ccgagcgaac 79320 

tgtgagttcc cgcccaggga ggggacgagg gtgttgtccc cagaggagcc agaaatgggt 79380 

ccacccaccc ccatggttct gcagatgcca gcgccggccc cccgggctac agcacccagt 79440 

gtgcgcggag aagatgctgt cagcgaggcc aagtcgcctg gccccttctt ctccaccagg 79500 

aaagctaaga gtctggacct gcatgtcccc agcctgaagc agaggtcctc aggtgcggac 79560 

gggcagcgct gggtgggcgg tgtgggggtg gcggagcggg cggcgtgggg cgggcagcac 79620 

caggcgccca gggcggaggc gactcacctg gctttgtgcg cttcccctcc cacctccaaa 79680 

ggctgcctct ccctcctagg gcagggcccc cacgggctgc aaccctcccc Cacaggcaga 79740 

gaacgcccca ggcaaggatg ccccccgagg ctgagactcc ccccaatagc agggaggaca 79800 

cccacaggca ggaccccaag tgctgggact cccccccaag aggggctttg ccacaggcag 79860 

ggaccccagc tggggccccc cgtgggcttc actgcgcact cgggtgcccc tgcagggtca 79920 

ccagctgccg gggaccccga gagtagcctg tgtgtggagt atgagcagga gccagttcct 79980 

gcccggcaga ggcccagggg gctgctggcc . gccctggagc acagcgaaca gcgggcgggg 80040 

agccctggcg aggagcaggt acagttccag ggccttggga tggacacaga ccctctgtct 80100 

cctgaggcca acccgacccc gcccatctgg cctcaggcac ctccccacac acccctgtaa 80160 

atcccctgcc tggcaggcag gcgggcaagc gggcggggga tcccagccgc ccggctgtct 80220 

gtgggtcctc caccccacct cacccacagg ctgctggctc ccaggtggtg catgccctgg 80280 

ccctccgcgg gtgcccccca catcactttg gttctctggc gggtcagctt ggctcagtgc 80340 

actcaaggtc gggtgcccct gccactggct gcgcttgagg ctggcctttc tccaggaatg 80400 

tgctgcgggt ggaacccagg ttccttcttc ctcggggcct tttgccccag aagcccataa 80460 

ttcctcaggc caacccgaaa ttttctccct gcttcctgct gggagccatt cccctcttcc 80520 

Cgcccatccc tgcccttcag gcccccggag tgagctccag gtgcaggcac caggcacctg 80580 

tgtccccttc ctgccagccc ctcgctgtgg tcggactgtc ttccctggac ctgctcttac 80640 

aagtcaccac ctgcgagcct catgagccgc tggtgtgact tggacaggac caagttgtgg 80700 

cactgtcacc ggggtgtgct gtgcccccct cccccgacct ccatcttggc tcagggctcc 80760 

ttgggaccat cttccctgtg cgtccaggtg ctttgggacc ccagagtgtg tggttggggt 80820 

ctgtgtgtgg ttgtgagctg tgtcctcctc aggcccacag ctgctccacc ctgtccctcc 80880 

tgtctgagaa gaggccggca gaagaaccgc gaggagggag gaagaagatc cggccggtca 80940 

gccacccggt gcgtgagccg tccctgcacc tgtgccgacc accatagaca cgcatgggaa 81000 

cgcagccgtg ggtgccccca gccacggctg gtcccgaCgg gaccagggaa tccaccccca 81060 

ggagccgatg tccagggcag ctgtgatgct gacggccagg ggcccaagtg tgtggtttct 81120 

tctgcagggg gctcatgagt cccagctgga atcaggcccc acccttgggc aggtttggca 81180 

tggggcctgc agcactgggc ttggccctgg catttccctc aagtgtggat gcacacctgc 81240 

ctcatgtgag ggacacagcc cattcctagc cttggatcaa agaacggagt tatagccgga 81300 
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gccaggaagc cccctgcctg cCggaaaacc ccaagtgtgg cggcctttgt ccatgtccct 81360 

tggcttctgg gaagaactgg gtggtgccca ggcagggctg gtgccatcag gaagtgggtg 81420 

gctgctgagg ggcctgggct ggcgagggcc tgggtgggga gtgcctgggc cgcccctgcc 81480 

ttggtttcca cgtttccgtg ttggtctggg gtgtgtagag agatgggcac tgctcatccg 81540 

gaagcccctc cttgtgcgct gccatcccgg gagcctcagc cgcatccgcC gtggggcagg 81600 

gggcttgagg gaggaggaga gagacgggcc atgcaggacc cctggcttga ggcagagcca 81660 

atctaccctt tgcccattca ctgctctcag ttccctgcca gcctctcact gtgtgacctc 81720' 

agacgggccc agccccacag ctttcttccc gcagcccctc cctatgtcca tccagccagc 81780 

cagtttctca ggcagcagcc ccacctcggc agtcactgtc ccagggaacg ctcaatgttc 81840 

caaggaaggc tctgcagccc cagggaccag atgatgaggc tggccctgat ggagcctcgg 81900. 

gcctgtgtcc cgcaggagga gcccgtggcc ggtgcacaga cggacagggc caagctcttc 81960 

atggtggccg tgaagcagga gttgagccaa gccaactttg ccaccctcac ccaggccctg 82020 

caggactaca agggttccga tgacttcgcc gccctggccg cctgtctcgg ccccctcttt 82080 

gctgaggacc ccaagaagca caacctgctc caaggtgccc tggcttgcag aggccaccca 82140 

ccctgagggc agtgctgccg ccgcgtgtgg ggtgggggcc atctgggtcc aaggtggtcC 82200 

ctgttctcta gagaaaaagg ggcagatggg gacagacgcc ccttcctcta caggcttcta 82260 

ccagtttgtg cggccccacc ataagcagca gtttgaggag gtctgtatcc agctgacagg 82320 

acgaggctgc ggctatcggc ccgagcacag cattccccga aggcagcggg cacagccggt 82380 

cctggacccc actggtaaat ggggccccag gtgggaccct cagactcctg cgtggaaggc 82440 

agtgtgggcc agagtcctgg gctgcttggg gtgggcaccG tcgggccctg cctggccccg 82500 

cctctctgtt cccctatggg agtgatgggg gcctccacct ccaccaccag caccagcagc 82560 

accacctcca ccttcaccac caccacctcc accaccacca cctccaccac ctccacctcc 82620 

accacctcca ccacctccac cacctccacc accaccacca cctccaccac caccaccacc 82680 

accacctcca ccaccaccac caccaccacc acctccacct ccaccacctc caccaccacc 82740 

tccacctcca ccaccaccac cacctccacc tccaccacct ccacctccac ctccaccacc 82800 

accacctcca ccaccaccac caccacctcc acctccacca gcagcagcat cacttgttgg 82860 

ggagaccctg tgcaactcca tgcacagccc tgtccctgcc atagccccga cccctaagca 82920 

cagccctgtc caactgccac acgtcccctg cctcccatgc atggtcctgg ggggtcaact 82980 

gcacacgcca gggtcctagg gtcctagacc cctgtcctcc ctgtttctgc ctctgtttgg 83040 

ggtggagtcc aagtctccag aggcggaagc atctgtgttc gtgtgttaat gaacagcccc 83100 

tacagagttc ccctagttca cccagggggg aacctagcct gttgggacga ccccagatcc 83160 

cttctgggct tggtactcac tgggatatcc tcatgcctgc acccagccta cggctctgag 83220 

ctcctgagtg gggctttggc ctgcccgcca ctgttccagc ccccatccag caggctggtg 83280 

tctcctctga tgcccccagc acccaggcgt gtacctgcct gggttttccc gccctggtct 83340 

gaggtgggtg aggcctggcc tccctagcca gccctgcccc cccaccccag ggaactttcc 83400 

agatgctccc gaccagcttt gtggctctac atctcttcat caggaagaac ggcgccggat 83460 

cccaagctga ccgtgtccac ggctgcagcc cagcagctgg acccccaaga gcacctgaac 83520 

cagggcaggc cccacctgtc gcccaggcca cccccaacag gtagctgact cctgaaccgt 83580 

gtgcagccta cgacttggtg ggtccctcag tggcttcacg aggctaactc ttgagtgtgg 83640 

ccggggctgc ccctgtgggg agccatctca tggtggggac tgctcccggt tctgcacccc 83700 

gcagttgtcc tgagcagctc tccaggagtt cctggaggaa gggcgggcag ggcggtggga 83760 

ctctcagtcc tccaccccag cgccactctg agccatgcta ctcccacacc aggagaccct 8382 0 

ggcagccaac cacagtgggg gtctggagtg cccagagcag ggaagcaggg ccagcacgcc 83880 

gtgagcgcct acctggctga tgcccgcagg gccctggggt ccgcgggctg tagccaactc 83940 

ttggcagcgc tgacagccta taagcaagac gacgacctcg acaaggtgct ggctgtgttg 84000 

gccgccctga ccactgcaaa gccagaggac ttccccctgc tgcacagcaa gtggccctgg 8406O 

cgtggggaac agccggtggg gtggggggca ggggacaaaa tgggggctgt gccgggtctg 8412 0 

attgaagctc cccgcagggt tcagcatgtt tgtgcgtcca caccacaagc agcgcttctc 84180 

acagacgtgc acagacctga ccggccggcc ctacccgggc atggagccac cgggacccca 84240 

ggaggagagg cttgccgtgc ctcctgtgct tacccacagg gctccccaac caggtagggc 84300 

acctgcctgg ctgctcctgg cagcgcccca accgcacgca gccctgggag tgagcagcaa 84360 

agccccaggc ccccctcaga ctcaagtctc tgtctccagg cccctcacgg tccgagaaga 84420 

ccgggaagac ccagagcaag atctcgtcct tccttagaca gaggccagca gggactgtgg 84480 

gggcgggcgg tgaggatgca ggtcccagcc agtcctcagg acctccccac gggcctgcag 84540 

catctgagtg gggtgagcct catgggagag acatcgctgg gcagcaggcc acgggagctc 84600 

cgggcgggcc cctctcagca ggctgtgtgt gccagggctg tggggcagag gacgtggtgc 84660 

ccttccagtg ccctgcctgt gacttccagc gctgccaagc ctgctggcaa cggcaccttc 84720 

aggttggtgc ctggccacta cagttcctgc tgggtgtagc cccaggtgat gggctgaggg 84780 

ggaaagggca ggcccttgtc ctggtggcaa cgcctggcag acgtgtgcag tgggccggtt 84840 

gtctcacagg cctctaggat gtgcccagcc tgccacaccg cctccaggaa gcagagcgtc 84900 

atgcaggtct tctggccaga gccccagtga gtgcccacgg aggcccccag cacacccaac 84960 
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gtggcttgat cacctgcctg tccagctctg gcgggccaag aacccaccca acagaatagg 85020 

ccagcccatg ccagccggct tggcccgctg caggccccag gcaggcgggg cccatggttg 85080 

gtccctgcgg tgggaccgga tctgggcctg cctctgagaa gccctgagct accttggggt 85140 

ctggggtggg tttctgggaa agtgctcccc cagaactCcc ctggctcctg gcctgtgagt 85200 

ggtgccacag gggcacccca gctgagcccc tcaccgggaa ggaggagacc cccgtgggca 85260 

cgtgtccact tctaatcagg ggacagggct ctctaataaa gctgctggca gtgcccagga 85320 

cggcgtctcc gtggcctggg cttggtggtg ggagttgagg gacagggagt tggcagaggc 85380 

ccctcccagc ctgccatgtg acactgtact tcctccacgg tgggctcagc cctgccctca 85440 

tcctcacagc cgcagccaag ctgcagttgg taggggatcc accgacacac caggctgcct 85500 

gggctggtct ctgggttggg agctgcccca ggtgctgagg agggcagctc cctggctggt 85560 

gaggcccctc ccagaaccac ccttggactg agctctgggg agggatggta ccaggtgggt 85620 

gaggggggct gcctggggag ggaggggttc ctatggggcg tggcgaggct ggcccagccc 85680 

tctccccgcc catatatgta gggcagcagc aggatgggct tctggacttg ggcggcccct 85740 

ccgcaggcgg accgggggca aaggaggtgg catgtcggtc aggcacagca gggtcctgtg 85800 

tccgcgctga gccgcgctct ccctgctcca gcaaggacca tgagggcgct ggaggggcca 85860 

ggcctgtcgc tgctgtgccC ggtgtcggcg ctgcctgccc tgctgccggt gccggctgta 85920 

cgcggagtgg cagaaacacc cacctaccGC tggcgggacg cagagacagg ggagcggctg 85980 

gtgtgtgccc agtgcccccc aggcaccttt gtgcagcggc cgtgccgccg agacagcccc 86040 

acgacgtgtg gcccgtgtcc accgcgccac tacacgcagt tcCggaacta cctggagcgc 86100 

tgccgctact gcaacgtcct ctgcggggag cgtgaggagg aggcacgggc ttgccacgcc 8 6160 

acccacaacc gcgcctgccg ctgccgcacc ggcttcttcg cgcacgctgg CCtctgcttg 86220 

gagcacgcat cgtgtccacc tggtgccggc gtgattgccc cgggtgagag ctgggcgagg 86280 

ggaggggccc ccaggagtgg tggccggagg tgtggcaggg gtcagg'ttgc tggtcccagc 86340 

cttgcaccct gagctaggac accagttccc ctgaccctgt tctcccctcc tggccgcagg 86400 

cacccccagc cagaacacgc agtgccagcc gtgcccccca ggcaccttct cagccagcag 86460 

ttccagctca gagcagCgcc agccccaccg caactgcacg gccctgggcc tggccctcaa 86520 

tgtgccaggc tcttcctccc atgacacgct gtgcaccagc tgcactggcC tcccccCcag 86580 

caccagggta ccaggtgagc cagaggcctg agggggcagc acactgcagg ccaggcccac 86640 

ttgtgccctc acCcctgccc ctgcacgtgc atctagcctg aggcatgcca gctggctctg 86700 

ggaaggggcc acagtggatt tgaggggtca ggggtccctc cactagatcc/.jcaccaagtc 86760 

tgccctctca ggggtggctg agaatttgga tctgagccag ggcacagcct cccctgggga 86820 

gctctgggaa agtgggcagc aatctcctaa ctgcccgagg ggaaggtggc tggctcctct 86880 

gacacggaga aaccgaggcc tgatggtaac tctcctaact gcctgagagg aaggtggctg 86940 

cctcctctga catggggaaa ccgaggccca atgCtaacca ctgtcgagaa gccacagggg 87000 

gaagtgaccc ccttaacacc aagtcaggtc cggtccatct gcaggtccca actcgcccct 87060 

tccgatggcc caggagcccc aagcccttgc ctgggccccc Ctgcctcttg cagccaaggt 87120 

ccgagtggcc actcctgccc cctaggcctt tgctccagct ctctgaccga aggctcctgc 87180 

cccttctcca gtccccatcg ttgcactgcc ctctccagca cggctcactg cacagggatt 87240 

Cctctctcct gcaaaccccc cgagtggggc ccagaaagca gggtacctgg cagcccccgc 87300 

cagtgtgtgt 'gggtgaaatg atcggaccgc tgcctcccca ccccactgca ggagctgagg 87360 

agtgtgagcg tgccgtcatc gactttgtgg ctttccagga catctccatc aagaggctgc 87420 

agcggctgct gcaggccctc gaggccccgg agggctgggg tccgacacca agggcgggcc 87480 

gcgcggcctt gcagctgaag. ctgcgtcggc ggctcacgga gctcctgggg gcgcaggacg 87540 

gggcgctgct ggtgcggctg ctgcaggcgc tgcgcgtggc caggatgccc gggctggagc 87600 

ggagcgtccg tgagcgcttc ctccctgtgc actgatcctg gccccctctt atttacccta 87660 

catccttggc accccacttg cactgaaaga ggctcttttt taaatagaag aaatgaggtt 87720 

tcttaaagct tatttctata aagctttttc ataaaactgg ttgtagttgc acagctactg 87780 

ggagggcagc cggggacacc tgagccgccc gctgtgccca gatccctcag gctgcctgcc 87840 

atcagaactg ctgcccgggg cttcccctac ctcagacaga ccctccctgg gaggatcagt 87900 

ggggagtgcc acctctgccc ccagtggctg tggcacgtgg caggggcccc tgaagctcag 87960 

cgagggtcag ggcctgggag ggtatcattg ctggaagaac aggatggggc tcaggccagc 88020 

cctagtcgcc ggggcccaca ctaacccccc acttatgaat tcctcccact cccaactcac 88080 

aggggatttc ccgagagggg acctgccaaa gacctcctcc aggcctccca tgctccccgg 88140 

gaagtgaagc ttctccccct ctggggcagg ctctgaagcc tcccgatgca cccagagcaa 88200 

ccagggggct gcaccagcca ctcgcctccc cagcacggcc aggttcccgg ggctggaggt 88260 

cccccccagg tcctgggaac caacctgcag aacacacaca gggtcccctg gagaggacgc 88320 

ggggactccc agggcccgac tcctgtgagt cacagccccg cagctgctgc gccaccccca 88380 

ccctgactca tgccccttcc cagcagctcc tcccaggacc ccatgtcctt cccacatccg 88440 

caggaaggga gtgcctggac tctccaggcc cacctgggga gcccctcacc tgcccaccag 88500 

cccctgagca gcccagtaac accatcaccg tgtccaacag ccaggagcct ccaccctcca 88560 

ggagggaagg gatggacaga gccacactcg ccgtctttat. tttgcactca ccctgggtga 88620 
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cactgggcag gccgctcctg cccacagcca gactgaggaa gaacacagca cCcggcaggc 88680 

ccagcggggt ccgtgcaggg aggaccccag gaccagcctt actcccgagc aggggacaca 88740 

gggccccaca gagaacccct ccgggaggtt ctctcccggc tgggggaggg ctctggaccc 88800 

ccacaaacac tccccaacct gcggggcCgg ggcataaaaa cagccactcc cagcaggccc 88860 

cctcagcttt ttgcatcagt cagctccctc ccgggggatt agggtgaggt gaagccaggc 88920 

ccaggcgtgg ggtataggtc ttcccccgca ggcctcagcc ctgtcccgag gctgcaEcac 88980 

aatccagggc ccccgctggc ctttgggaac atggcctggg tcttcctcaa ggcaagatca 89040 

gccccagacc acttccgggg tcacggggcc acagggcaga agccagatgg cagccatggc 89100 

tgacgggcct cctcctcgat ggggcggaga cagccacggg gtctcccgag ggtcccacag 89160 

ggctgtcctc atgcagccca agccagcctg agcactggag ccccaattcc caaccaggtc 89220 

tccctcagac cccccagaaa gggcctcgaa aggccgccgc. tgcgccctgt ggaaaggctg. 89280 

ccgctgcagg gcctgggcca gccgggctgc cagacccccc tccaaagcct ccggatgcct 89340 

acgcttttcc agacatagag gaaagtttgt cttcgagaaa acaaagtaaa tagaagaacc 89400 

ccaaagcaaa gcaaacccac cccccagatc agcagcatgg gagccaacag gaggccactc 89460 

ctccagcacc aggggaccag ccgtcccgac ggcagcgcgg ctgcgcctac gtgatgtccc 89520 

tctgccgcgg cggccggtgc acatcccgca cgacacactt caecatccac tcgacgccct 89580 

cgcgcacccc tttgctgtga agacagcggg tgtgaggcgg ggggtctcgg tccccaaagc 89640 

ccccgcaggt gcagccccca ctcaccctgt: gagggccgag caggcctggg tcaggcaaCc 89700 

gcgcctgccg atcttgctgg tgcagtcgct gaaggccgtc ttgatgtcag ggattgagag 89760 

gcacgtctgg gggaggtaag gccgtgagga gcagccccca cgtctggccc tgtcctgcct 89820 

gtgggcccgg gactctcaga agggcgtatg cccttcaccc cagggaaaca gccagagctc 89880 

caccagggtc ccagtgtctc ccacagagac cacagcagtg aggaccccgt gctcagcccg 89940 

aggctgaaca tggctggtag tgcctgagac aaactagacg tccacacggc tccaaggagC 90000 

ccacccccca tcccctccct gggggacacc ctgagccccg aggtggggcg ctgaggactg 90060 

aggcctcctg ggcagtggcg gaggcaggtc ccaggggccc acacagccgg ggatgatgga 90120 

gaggtgggag ccctgcatca gtgatggggg cagtctgcag Ccatggtggc ttctgctcac 90180 

aaccacctgc ccagtcttca aaaagcagcc ctcccctccc cttttccCcc gaggggagac 90240 

ccctgccccg taccagatgt ccctcttgtc ggctgagatt gtaggggagg ccagccttac 90300 

aggctggggg caacagagcc accccagaga aggcaggaag tgaagattca cccggccctc 90360 

tggacgccgg gctgcttctg tgcaaagcca ctccaagaga acagctagaa ctcagcgtgg 90420 

ccagtgctcc cgggggcagt ggcacctcag aggggtcttg aggggctgcc ctgggggtgg 90480 

ggctggcaca gatgccacct ccaagggtag caggaacagg taagggtcag agctgactcc 90540 

caccagggcc ccagcatcac ttctttgagc tctgagtttc acctgggtgC ccccacagcc 9O60O 

tggccacaca ctcctgagac acggccgccc tcctggggag aggtgccctg catagcagga 90660 

agaggcctcc gggcgcctgc cctgaggtgg gagaacctcc agggctggca gcagcaggtc 90720 

tggagaggaa ccaagcttgg gaagctgctg ggggcagggc aggccttgag aatggctctg 90780 

taccccctgg gcagtcacCg ggcctggggt gtctgggtgc acacctactc cccttgctgt 90840 

gggggaggct ggggactcgg gaagctgctg cgggaggcag gggtggggct cacctccaca 90900 

tcctgcttgt tggccagcac caagacgggg acaccgcaca gcgcctcgct ggtcaccacc 90960 

ttctctgggg agggcaggag aggcagcgcc tcacacccag catcctgcct ctgactgccc 91020 

aggggcccac aggcgtggac actgtgacag ccactccctc tgcccccccc ccgtcaccca 91080 

ctaggcagga gcacttctga ccagacactg agcctgcccc aggcacagag ctgcccaagc 91140 

tggacctgcc cccactcacc atccatccct cccagagcag ccaggccgca ctcaccaaac 91200 

gcctgcttgg actcagccag cctctcctcg tcggtggagt caatgacgta gatgacgccg 91260 

tgacactccg cataatactg ggaggaagca ccaggagttg gggctcagtc cccaccctgc 91320 

caagggccag cagagccagg cctgtgtcat ggccacagtg aggggctcac atgaggaagg 91380 

ggcaagaggg cagcccccaa ctgcaagacc cttctgggat gcattctggg gttgcgggga 91440 

gatctggtgg aggtgtcccc agacgctgct cctgagaacc tgccggcaac ctttggcctg 91500 

atggtggcca aaggtgaaag acagggattg ggccaggcgt ggtggctcac acttactatc 91560 

ccaacacttt gggaggcaga agcaggagga tcacctgagc ccacttcacg gccaacctgg 91620 

gcaacacagt gagactccgt ctgtacaaaa gcttatggta atgtgcgcct gcagtcctag 91680 

ctactcggga ggctgaggtg ggaggatggc ttgagcctgg gaggttgagg ctgtagtgag 91740 

ctctgatcac accactgcac tccagcctgg gtgagaatga gagaccctgt ctcaaaaaaa 91800 

agatagggtt tgggggctgg aggaacctag accacagcct ggcccgttga gggagtgcac 91860 

ctgtggggct ctgtgccagc acctcgcaca gggagggagt gtggccatgc ggataagact 91920 

gaccagcacc atctacgaag cgagccttcc ctgccaggac agggccagag tcactgagct 91980 

cagacctctg cagcctgggc tggtcagtcc tgggctcgct ggcaacactc ctgggcaaga 92040 

cagggcacag cccctgcagc ctcaggtaca agtgctgagc cctggaccag atgagtgcac 92100 

ctctatctca atcagaaaaa aacacagcaa actccgcgtc cacgtggagc agacaacagc 92160 

tcacatttgc cactttgcct ccaggctgtg ccagctctcc tgtccaggca tgagtgccca 92220 

gagacctaga actggatgct gaccaggtag gacaagctgg tggtcagtgt gttaagacac 92280 
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acacacccga gagcatgaga agccaggagg cacagcccaa ctctccgaaa tccttagggt 92340 

gtctgagcag ggagtaccag acaaccccat cccagtgcca gacaagcttg tgcacctgca 92400 

cttcccacag aggagagaag cctgtgcacc tgcacttccc acagtggaaa ggaggaggcc 92460 

caaggccagg cccccccacc cccaggaact tcccacagtg gagaggaggc ccaaggccag 92520 

gcgccctcca gggttctgca ggtagcgagg cccccccacc cccaggaact tctctggcct 92580 

acagacaggt cccacacaga ggccgccaac ccctcaaggg accctgcagC gtgccggctg 92640 

tctgctgctg acacaaggga gcaggcggac cctaaggtgg agacctctgt ggcaggaggg 92700 

gcggctctgt ggaggctgca gcaagcccag tgagagaatc tccacgtggc tcctggggct 92760 

tctgagcagg gtggcagaag gttcatgtgc aaccgggtcc tggaccatgg gaccacgCgg 92820 

ccagagccac ccatcacacc taccaggcac aaggtgcaca gcccagcagg gccgcagtgg 92880 

acgggagcga cacctcaggg ctgagtgcgg gcaggaccca gagccccacg ccccagtgga 92940 

ggcgtcacag cagtggtcat tgtggggtgc cccacaagga gggggaagag ggaggtgtcc 93000 

cagcgtggct cctggctggc cagctgaccc eagtggagca gtcagaggga ctgtgggtct 93060 

gagtttttct ccccagcagc aatgggagct ccccaactgc aaagtgccag ccagcctgag 93120 

agactagtgt Cacagcaaag aacccaggag ctgaggtccc ggcacatgcc acacatgtgg 93180 

acaccaaccc agggtccagc cccaggacga ggccaattcg caaCgacgcc cctttctgtg 93240 

gtgctggccc tgcacaagga tgcaggatac aggaaccagg gtgggagcag gggcctccct 93300 

tccggtccct cccagtgacc taggggggtc cctgcagctg atcctcccag ctctgagctc 93360 

agcagggtca ggggtcccgg ccaccagagc agcacatact cagcagacac gctgaatgac 93420 

gagccacagc tgcctcatgg gcatgacttg cacctcatgt ctaggagacc ctggtgggca 93480 

ggagatgggg ctgccatccc acagctgtcc cacagctggg gacccaggga gccactggcc 93540 

ccaccacggt ggtgtctgga gaagggctca gactgccagg aagtcgcacc ccagcagaag 93600 

tggtagtgaa ttgggagggc actcaaggaa gggctgtgca gccccaagac cagcagcaag 93660 

gatgggctac agCggccccc ttaagtctcc ctcttccagt ttcgccttaa gagaggccct 93720 

caggaccttg gaggaacccc tctccaacgt ggaagtgtgg gcccacatag ggctgcagcC 93780 

gtggccagtg caggcatctc tggccccact gtattcttgc ttcatgttgg agaacactgc 93840 

accagcagat ggtctcatct tggtttctgt gggacccact ttggctgcaa agagccacac 93900 

tgccaggtca cacctgccca gggcagccca cactggggac ccaccaggcc atggtgtgaa 93960 

gtcccggcca gcctggcccc acatggcaca gcatagccag ttctcctcca gggctccctg 94020 

ccgggccaac cacagctctg cggatcctgc cgcctgagtc gacctctcct ctcccgtcct 94080 

ccctgccttc ctggtgccga cccccagtgt gcatccCgta cctcgacctg cctcagcatc 94140 

tgtgcctgag acaccggccc gtgacaagat catcatcatc tgtgtcactc cccaagcatg 94200 

ctgcgcactg gacacacagg ccctgactca acttgtcctg tctgacttca gtggtcctac 94260 

aggatctatc agagatcact tggccatggg agaaatgtct tcttggcCag aagtcacagc 94320 

aggaggggac actttggggg cgcctaggaa aggggaacta ggatcaaaaa agagatcagg 94380 

acctgggcac tcagctctag agatggcatc agggcagcca aggcactggg gacaccccac 94440 

acccactgtg ccagcctagg gcagggagcc cgaggaagcc acaggctcCg ccctgctcag 94500 

tgctggactc agtgcctggc ccaggctgag aaggagataa actgcagcct tgggggtgtg 94560 

gggaaggggc accacactgg gatctcagaa atgcccaaaa cctgtgtcaa aataggagac 94620 

tgccgctgtg agaccctgag gagtcttctg gtgatcatgg aagaacaaat gttaagctag 94680 

aactgaagga acctcatcag gggagaggca gccatcctgc cgtccccaca tctggtcttt 94740 

gccatttctg tgtcctgtgg tggtcagcag caaggtctct gagccgaaag gaggcactca 94800 

ctttggagga gtgcagggtc cccaggtccc cacactttgt cttgtcctga ctgagaaaga 94860 

aacagactgc cctgacctct ctgacttggc cagcgaggtt gcccttaggc tcaaacccaa 94920 

gccagggttt gaacatcccc agacactcgt aagatgttta ggtcgttaac ataatgttca 94980 

ggtttcaaaa cactgaaaga aactagcccc agccctgaac ccagatcccc cccggcttca 95040 

ggcatgacca gcgaacacgc ccctctctca ctggtcacct gaggatgccg cactctgtca 95100 

acaggttccc ctaatacatg cCctgatctg atcgccttgg catttagtga ttctttccct 95160 

ggaattctcc actggcccca tcgcagggaa ctcccaagtg ggaaactccc ctaccaccac 95220 

ttctggggca acttcagcta agggttcagc tgggacaaaa cagggagcca ctcgggaacc 95280 

tgggacagga ccagagagaa aacccgaggg acagagtggg taaggaaagc tgccgaggaa 95340 

gggcccaaag ggcactctgg aaagaagtgg cactggaggg ctggggtggg ggtggtcctg 95400 

gccagggagt cttaccttgt cccacaaaga ctgcagctct tcctgccctc ctaagtccca 95460 

gaacatgagc cgagcctttc ccacatccac agtgccgact ggggagagga ggaaacaggc 95520 

aaggctcatg accttggtcc tcgacacacc cagtcccagc tctcccaggg gatggggcaa 9558 0 

accacgctgg tgccactcaa atgagacttg agaggggccc gacagggctg tggccacggg 95640 

ccagctggac tgtgaatatc' acggcatcct caaggcccca aacccacagc ctgctatcga 95700 

gacccttact gtttaggccc acggtggtgg tgattttgga tagactcatc cccttgtagt 95760 

ccttgctaaa tcgggttttc gactgctcca ggaaggtctg aggagagagg cagaggcgaa 95820 

acacatcaag gaggggctat actggcttcc aaatatcctt actcaggtct gttctttaaa 95880 

agacagaaac agaaacagag caacactctg ctcttcagga ggctggtggt gactatcctg 95940 
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ccgtctcagg tgaaatttgg cttccgtctg ggtagtgaac gtgcagctga cagcacaaaa 96000 

ccgaaggggg cgccgccagg ccgcgggaaa ggtgcgcgca agggcgtggg cactcaccgt 96060 

cttcccagca ttgtccaggc ccaggaCcag gatgcagtac tcgtccttct gaaacatgta 96120 

cttgtacaag cccgacagca gcgtgtacat cctgccctgg gcaccccaac acaggtcagt 96180 

gtgcagccag aaagcacctc ccctcccccg ggcttctcca cggtggtcag tggcgcccca 96240 

cgtccagccg accgctcagg acgagagcct gggggccatt cccgactcct cgtccctctc 96300 

ccaccccgtc cctctgtaac ttctcccagg tcagccgcca ctgcgtcctg ctcacagcaa 96360 

tgactgcgac ctctccgcat acacatcggt tccggcccct cccctgctcg cgggactacc 96420 

cagccgggtg ttcacagtga gctcagccgc gctcccgccc tcccccgagg cttcgctccc 96480 

acgcttcacg cgcgcggaac ggggaacaca ctcgctgcag ccccgcctgg gccacggcac 96540 

cctcgagcgc cagccccgcg ccccacccgg gagcagcgag ccaccggcgc gctccccagg 96600 

agcccctgca ggcgccgggc agggacgccc caccacccca tttcttaaaa cggggacggc 95660 

- cctgggggga gcggactaca gggcgggtga gcagcggcgc ggctgctcct ggagtgcacc 96720 

tggaggcggc gcgcggctgg cagggaacga ctgcgaagga agaacctggg tcgcggcccc 96780 

cggctacgtc cgccccaagc cgccgccgcc aggtctgagg ctccccgaca agcagccaaa 96840 

gctggctcct gtcacacccg cgtcccacct cgagtcctgg gccgcccctc gggcctcgcg 96900 

cctcaccgca cagcctgcgg cctacctgcg tccgccgcgc cctcggagcc gctgctgctg 96960 

acccccgctg acctccgctg accccgcgct aaccccgcgc ggcgcctgac gggacgcggg 97020 

ccggcctcag ggaatgagct gaaccgcgtc ccagcggcct ccgcgctccg ctCcccggct ■ 97080 

gcccccgcgc gccaagcact tccggaagcg gcggcgctcg ggaggaagtg ccgatcggct 97140 

gctggggcga aaagggggcg ccgggccgct ctagccggtg aggccggcgg gctctctgtg 97200 

gctgcggctg ggaaaccgcg cggaggaggt gcccggccgg ggaccaggtg gccgcggttt 97260 

gcggggacgc ggccctggcc agacagaaga gacgccgggc gggggggcgc ggccggcctg 97320 

gaaggcggcg ggcgcggcgg gtgggctcgg cggagggtga ggcggcgggg cgccccgcgg 97380 

ggaaggggct ccggagtgac gcgggacccg gctagcggcg agcccacggc ggctcggaag 97440 

ggaagcgcgg agcctgagcg ggggtacccg ggctgcgacc tctgcgctgg gagctgtgcc 97500 

Cctgagccgg tgcctccccg agggaaaggg gacgtgcccg tgcccgtgcc cgccctcagg 97560 

ctgtggggtc ggtcccgaga cgcggggctc agctggctcc tcttcttgca gccctggtcc 97620 

agcgcctccc tctctcagca tggacgagga gagcctggag tcggccttgc agacctaccg 97 680 

tgcgcagctg cagcaggtgg agctggcctt gggcgccggc ctggattcgt ctgagcaggc 97740 

tgacctgcgc cagctgcagg gggacctgaa ggagctcatc gagctcaccg aggccagcct 97800 

ggtgtctgtc aggaagagca ggttgttggc cgcgctggac gaagagcgcc cgggccgcca 97860 

ggaagatgcc gagtaccagg ctttccggga ggccatcact gaggcggtgg aggcaccagc 97920 

agcggcccgt gggtccggat cagagaccgt tcctaaagca gaggcggggc cagaatccgc 97980 

ggcaggtggg caggaggagg aagagggaga ggacgaggaa gagctgagtg ggacaaaggt 98040 

gagcgcgccc tactacagct cctggggcac tctggagtat cacaacgcca tggtggtggg 98100 

aacggaagag gcggaggatg gcccggcggg tgtccgCgtg ctttacctgt accccactca 98160 

caagtctctg aagccgtgcc cg.ttcttcct ggagggaaag tgccgcttta aggagaactg 98220 

caggtaaagc cctttgttgt cagatgccaa ccttaggggc gtaaggggca cgcacacagg 98280 

gtcgggtcag gatcggccct ccctttgctt tgcagttttg tctcagcttc ctggggcagg 98340 

cgtgctttga cagctgtgtc cgtgctcagg cgcctacgtc ttccttctgg ggtgaatcaa 98400 

gaagcatgga aggaggccag gcgcggtggc tcacgcctgt aatcccagca ctttaggaag 98460 

ccgaggcggg cagaccacct gaggtcagga gttcaagacc acgctggCca acatggtgaa 98520 

accccatctc cttaaaaaca caaaaatgaa ccggtcgtgg tggcgcgcac ctgtggtcct 98580 

ggctactcag gaggctgagg caggagaatt ggttgaaccc aggaggccga gtttgcagtg 98640 

agtggagatg cagccactgt actgcagccc gagcagcagt gcaaggctta tgtggaagag 98700 

agtaggtctc cagcctatcg tcagtttttt tttggtggtt gttttaattt tttttgagac 987.60 

agggtcttac tttgtcaacc aggctggagt gcagtggcat agtcctggct cactgcagcc 98820 

tggacctcct gggctcaacc gatcctcctg cctcagcccc cctaggagct gggctacaga 98880 

ctcacgctac tacacccagc taatttttat attactataa ttttttatct tttttttgag 98940 

acggagtctt gttctgttgc ccaggctgga gtgcagtggc gtgatctcgg ctcactgcaa 99000 

gccccgcctc ccgggttcac gccattctcc tgcctcagcc tcccgagtag ctgggactac 99060 

aggcgcccgc caccatgtct ggctaatttt ctgtattttt agtagagacg gggcttcacc 99120 

atgttagcca ggatggtctc aatctcctga cctcgtgatc cgcccacctt ggcctcccaa 99180 

agtgctggga tgacaagcgt gagccaccgc gcctggcctt ttttttttgg agacagagtt 99240 

tcactctcct cacccaggct ggagtgtagt- ggcgcaatct cagcttaccg caacctctgt .99300 

ctcccgggtt gaagtaattc tctacctcag cgtccagagC agctggcatt acaggcgccc 99360 

gccaccacac tcggctaatt ttttgtattt ttagtagagt cggagattca ccatcttggc 99420 

caggctggtc ttgaactcct gacctcgtga cccacccacc ttggccCccc aaagtgctgg . 99480 

gatcacaggc gtgagccact gcgcctggcc ctgttgttag ttttattctc tagagttcaa 99540 

cttttaaatt ttactttcat ggagattttc aaacataccc caaattagag agtttagcat 99600 
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aatcaccgcc 
tttgtctgtt 
aagttactgg 
aggaaattac 
gtaacttctg 
atgaacaatt 
tttttacttt 
tagccaatag 
gtcctccttt 
gcagtggcac 
cctcagcctc 
tattttttgt 
gatgagagat 
ttcatatact 
ctgacagagg 
ggctccagtt 
tcggcatcag 
attcaggact 
agtttcactg 
agggattctc 
gtgcccagct 
gtctcgaacc 
gcatgagcca. 
aagtatatag 
tgtcctttga 
gcctatgttt 
cctacgtttt 
tttgagttaa 
atttccagtt 
tggcactttt 
. tcctgttcca 
ataagtttta 
ttttgtttct 
cataaaagac 
acattgtcat 
cccagtactt 
ctggccaacg 
gcgcacacct 
gaggaggagg 
gagactccaC 
tcacgaatga 
actgttcaag 
aattgttttt 
agtgcagtgg 
ctgctcagcc 
tttctttgag 
ctcactgcaa 
ctgggaatac 
gggtttcacc 
cggcctccca 
ccttagtcac 
ggctcacgcc 
ggagttcgag 
ttagtctggc 
aattgcttga 
gcttggggaa 
tttttatgtt 
atgcactatt 
tatatgtgaa 
ttcttgctta 
cgggccgggg 



cacggtccat 
ctccctgccc 
ttttttccaa 
ttaaaaagta 
tgtcataata 
attactgtaa 
cctcttagaa 
ttcaggctga 
tttttttttc 
aatctcggct 
cctaatagcc 
agagacggga 
ctgacacatc 
ttcattcatc 
aggaaacagg 
gcccccagcc 
gcctgatctg 
gataagttct 
ttgctgccca 
ctgcctcagc 
aacttttaca 
cttgacctca 
ccacacccgg 
tttgcagata 
Cgcacaaaca 
ttgtcatcag 
ctcctaagag 
tttttgtaCa 
ttcccagcac 
gtccaaaatc 
ttggtgtata 
aactcaggaa 
ggatcctttg 
agtttgaatt 
ctttacaata 
tgggaggctg 
tggtgaaacc 
gtaatcccag 
ttgcagtgag 
cttaaacaac 
aggatctgtc 
tcttgcacat 
cttttccttt 
cacaatcttg 
tcccaagtag 
atggagtctt 
gctccacctc 
aggtgcctgc 
atgtagccag 
aagtgctggg 
attctcaggc 
tgtaatccca 
accagcctgg 
atggtggcag 
acctgggagg 
caagagtgag 
aatcctgtat 
taggattttc 
tagagataat 
actgccttag 
gtggtggctc 



cacccaatgt 
tatttttttc 
ttatgaatac 
actttttgga 
ggtaacacat 
tgcctctatt 
gaaatattta 
ccgtgtaacc 
ttgaaacagc 
cactgcaatc 
gatactacaa 
tttcaccata 
cctgtggtgc 
tgtgcaacag 
agtagaagaa 
ctcgttccgt 
aaagcttccg 
cttggacctg 
ggctagagta 
ctcccaagta 
ttcctagcag 
ggtgatcccg 
cccaggattt 
tttgctccca 
ttttaaattt 
gtttaagaaa 
ttccgtagtt 
tgttattagg 
catttgttta 
aattggcaat 
tgtgccagta 
gtggtagtta 
caatttcata 
ttgatatgga 
ttaagccttc 
aggcgggcat 
ccgtctctac 
ctacaggaga 
ctgagatcat 
aacaataaca 
catttattta 
tcttggttaa 
tttttcttga 
gctcactgca 
ctgggatcac 
gctctgttgc 
ccgggttcac 
caccacgccc 
gatggtctcg 
attacagatg 
tgtttgttgc 
gcactttggg 
ccaacatggt 
gtgcctgtaa 
cggaggttgc 
acttcatctc 
tgattgctga 
gatatataca 
tgcagtcttt 
ctagaacttc 
acacctgtgt 



cgtcactcat 
tttcctgggc 
ttcatagttg 
agtccaagtc 
ttaatggtaa 
tccctaagcg 
ccaagccttc 
atccctagtt 
gtcttgctgc 
tccgcctccc 
gtgtgcactg 
ttacccaggt 
tccctctgga 
ccctgtaggt 
ctgagtggtc 
cccctgctcc 
gagcatctta 
cgttggaccc 
caatggcacg 
gctgggatta 
aggcagggtt 
ccttggtttc 
ctttatatat 
ttgtttgggt 
tgatgcagtg 
ccaccccatc 
ttagctgtta 
tgagggtcca 
aaagactgct 
atatgtaagg 
ccacactgtt 
ttcaccattt 
tgaattttag 
ttgcatagaa 
tggctgggtg 
atcacttgag 
taaaaacaaa 
gggtgaggca 
gccactgcac 
gaagaaaaaa 
cgtctttaat 
ataagtatta 
gacagagtcc 
acctccgcct 
aggtgcctgc 
ccaggctgga 
accattcttc 
agctaatttt 
aactcttgac 
tgagccactg 
tagtatatag 
aggccaaggc 
gaaatccagc 
tcccacctac 
agtgagctga 
aaaaaaaaaa 
attggtttat 
atcatatata 
gtttccagtt 
aagtacgatg 
tcccagcact 



caatatttcc 
cattttaaag 
catctctaat 
cgatgtgagg 
tacctcttcc 
catagcttta 
tagtaggtaa 
ctagttctag 
tctgtcaccc 
tggcccaagc 
ccacgcccag 
ctcgaattcc 
ccaggcactg 
aggccctgca 
cagggcttca 
acccagtgct 
cagacgtcca 
tttttttttt 
acctccacct 
caggcgcctg 
tcaccgtgtt 
ccaaagtgct 
tctggatatc 
tgtcttttca 
caatttattg 
catagttatg 
aatttaggtc 
ctttattctt 
ttttctccac 
gtttatttct 
ctgattatta 
gctcctcttt 
gatcggcttg 
tgtgtagatc 
tggtggctga 
gtcaggagtt 
aaacaaatca 
ggagaatcgc 
tccagcctgg 
acagtattaa 
ttctttcaac 
tttttgatgc 
tgctctgtca 
cccgggttca 
caccacaccc 
gtgaagtggc 
cgcctcagcc 
ttgtattttt 
ctcaggtgat 
tgcccggctc 
aaatacaagc 
gggcggatca 
ctctattaaa 
tcaggaggct 
gatcgcgcca 
aaaaagaaat 
tagtgctaat 
ttcaatatat 
tgcatggcat 
ttgaataaaa 
ttgggaggtg 



ccagtctcat 
caaatcccag 
ctaactgatt 
acaaaaaaga 
atattcaaat 
tttttcctcc 
tttcctttct 
ttctttgaat 
aggctggagt 
catcctccca 
ctaatttttg 
tgatcccttt 
ctccaagggt 
gtcacaccat 
aggctcagag 
gcttgccatg 
ccttgccacc 
tttaagatgg 
cctgggttca 
tcaccacgtg 
ggccaggctg 
gggattacag 
atcccttatg 
cttgatatag 
tttctttatt 
aggattttca 
ctcgatccat 
ttgcatgtgg 
tgaatggtct 
gagctctctc 
tagcttcgtg 
tccaagtttg 
tccaattctg 
tgtttggggc 
cgcctgtaac 
caagaccagc 
gtcggaggtg 
ttgaacctgg 
gtaacagagg 
gcctcccaat 
agtattttgt 
ttctctaagg 
cccaggctgg 
agcaatcctt 
agctaatttt 
ccaatcttgg 
tcctgagtcg 
agtagagatg 
ctgcctgcct 
gagttgtttt 
tgggcaccgt 
cctgtggtca 
aatacaaaaa 
gaggcaagag 
ttgcactcca 
acagtggatc 
aggatttttt 
acaattaata 
ttatttcttt 
gtgactagag 
gaagtgggca 



99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
.100440 
100500 
100560 
100620 
100680 
100740 
100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
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gatcacttga gatcagcagt ttgagaccag 
ctaaaaacac aaaaattagc tgggtgaggt 
agggtgagac atgagaattg cccgaacctg 
tgccactcca ctccagcctg gacgacagag 
aaaagtggct agaacaaaca tctttatctt 
ttcctgatac taggtggaaa actttcagtc 
gttttctgta gattcccttt atcgagttga 
gtgtttttta tcatgaaagg gtgttgattt 
cccaggctgg agggcagtgg catgatcatg 
ggggatcctc ctacttcatc ctcctgagta 
cagctaattt tttaattttt ctgtagaggt 
taaactccag ggctcaagca atcctgcctc 
gagtcactgc actgcaccca gccgtgtggg 
tcatgtgtgg tttttttcct ttcattttgt 
accatccttg aattcctcag ataaagcacg 
catctttttt gtagagatga gatttcactc 
ctaaagtgat cctcctgcct cagcctccga 
ggccctatct tttttctttt tctttttttt 
ccgggctgga gtgagtggcg cgatctcggc 
caattctGCt gcctcagcct cctgagtagc 
gctaattttt tgtgttttta gttgagacgg 
cactcctgac ctcgCgattc acccaccttg 
agccaccgca gcgagcctta tcttttcaac 
aacattgctg gatttgattt gctgctgttt 
gggatattgg tctgtagttt Ccttttcttg 
taatattggc ctcatagaat gagttaggaa 
agggattggt gttaattctt cttcaaatgt 
acaatcatat atatagagag agagagagag 
attgactatt gacccaattt ccttattgaa 
ataactgtag tgaaagtttc tgaacttttc 
Cttatttgtt ataggcctat tcagattttc 
ctgtcgccca ggctggagtg cagtggtgcc 
agttcaagtg attctcccac ctcagcctcc 
catacctggc taatttttgt atttttagta 
ggtctcgaac tcctgacctc aggtgatcca 
acagacataa gccaccgcgt ccagccttga 
taggaattcg tccattttgt ttatgttatc 
tccttttata gccctagtta tttctgtaag 
ggttttagca atttgagtca tctcctttct 
cactgtgtca. cccaggctgg agtgcagtgg 
cccaggttca agcaattctg ccttagcctc 
ccacacccag ctagttttgt tttgtttttt 
ccaggctgga gtgcagtggt gcaatctcac 
caattcCcct gcctcagcct cccgagtagc 
gctgattttt atacttttag tagagatggg 
actccctacc tgaggtgatc cgcccacctt 
gagccaccat gcccagccag tttttgtatt 
ccaggccggt cttgaaatcc tgacctcagg 
taggattaca ggcatgagcc accacgcatg 
aaaggtttgt caattttgtt gatctttttt 
tttcatttat ttccatttta acctttgttt 
ctcttttttt cccctaattt ttcaaggtat 
tttcttttct tttctttttt tctttttttt 
tgtcacccag actggagtgc agtggcatga 
gcgattctcc tgcctcagcc tcctgagtag 
atgtaagcat ttacagctac agatttccct 
attgtttttt gttgtttttt gttgttgttt 
ccgtttggag agcagcgatg cgatcatagc 
gtcctcctgc ctcagcctcc caagtagctg 
taatttcttt tataagtttt tgcagaggcc 
cactttggga ggccaaggtg ggtggatcac 



cctggccaac acggcgaaac 
gatgtgcacc tgtagtccca 
gggggcggag gctgcagtga 
caagaaccct gtctttaaaa 
gttcctgatc ttaggtggaa 
tttcactgtt gaatatgatg 
ggaaattctc ttatattcat 
tttttttaaa gatagggtct 
gctcactgca acctcgaatt 
ggtgagacta caggcatgag 
agggtcctgc tttgctgccc 
agcctcccaa agtgctgaga 
atttttcaaa tgcttttttc 
taatgtggta tattgatttt 
catattcatg gcgtattatc 
tgttgcccaa gctggtctca 
aagcgctggg attataggca 
ttttttttga gacagagtct 
tcactgcaac ctccatctcc 
tgggactaca ggtgcccgcc 
tgttttgcca tgttggacag 
gcctcccgaa gtgctgggat 
agttaaaagt ttaaggcctt 
tgttgagaat atttgcatct 
gcatgtcttt gtatagcttt 
gtattcttta tattatggga 
ttgatagaat tcaacagtga 
agagatggac ttttcttttg 
attgactttt ctttttggaa 
tttcattgga agttttttga 
tgtttcttct tgagtcagtt 
atcttggctc actgcaactt 
ccagtatctc ggactacagg 
ggaacagcat ttcaccatgt 
cccgcctcgg cctcacaaag 
gtcagtttag atagtttgca 
taatctgtta ccatacaatt 
atcagtagta atagctccac 
tcttctttct ttttttttga 
catgatcttg gctcactgca 
ctgagtagct gggattacag 
tgtttttgag acggagtctg 
tcattgcaac ctccgactcc 
tggaactata ggcgtgcacc 
atttcaccat gttggccagg. 
ggcctcccaa agtgctggga 
tttagtagag atggggtttc 
ttatccacca gccttggcct 
gcctgtcttt ccttcttggt 
gttgctgatc tctattgttt 
ccttttttct gctggtttgg 
acagttaagt tattgatttg 
tttggttgct gttgagatgg 
tctcagctca ctgcagcctc 
acgtttcccg gccaaggtgt 
ctaaacactg ctttcactgc 
tgttgtttga gacacagtct 
tctgtagcct tgagctcccg 
ggactacagg tgtacaccac 
aggcacagtg gctcacacct 
ctaaggtcag gagctcgaga 



cccatctcta 103320 

gctacttgag 103380 

gccaagatca 103440 

aaaaaaaaaa 103500 

aacttttttg 103560 

ttacttgtag 103620 

agtgtgttga 103680 

tgttctgtca 103740 

cctgggctca 103800 

ccaccatgcc 103860 

aggctggtct 103 920 

ttacaggggt 103980 

ctttagatga 104040 

cgtatgttga 104100 

tctttattat 104160 

aactcctggg 104220 

tgagccactt 104280 

cactctgtcg 104340 

cgggttcaag 104400 

actatgccca 104460 

gctggtcttg 104520 

tacaggcatg 104580 

atcatgtaat 104640 

gtattgataa 104700 

gatgccagca 1047 60 

agaggtaaaa 104820 

agtgatatat 104880 

ttggaagttt 104940 

gctaaaatgt 105000 

ctactgattc 105060 

tggtctcgct 105120 

ctacctcccg 105180 

cgcacgccag 105240 

tggccaggct 105300 

tgctgggact 1053 60 

tgcatgtttc 105420 

gttcatagta 105480 

tttctctctt 105540 

gatggagtct 105600 

acccctgcct 105660 

gtgtgagcca 105720 

tttctgtctc 105780 

cagattccag 105840 

accacgcctg 105900 

ctggtcttgg 105960 

ttataggcat 106020 

tccctgtcgg 106080 

cccaaagtgc 106140 

cattttcgct 106200 

tcccattctg 106260 

gtttaatttg 106320 

agatctcttt 106380 

agtctccctc 106440 

cgccgcccag 106500 

ttctttttga 106560 

attccataag 106620 

cactctgttg 106680 

gactcaatca 106740 

tgcacctaac 106800 

gtaatcccag 106860 

ccagcctggc 106920 
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cgacagggag 

tgcctgtaat 

agaggttgca 

ctccatctca 

caggccggtc 

gagattatgg 

cattttcatt 

ttattcaaga 

tttttttttt 

cttgcctcac 

agtagctggg 

acgtgctggt 

ccaaagcact 

atagagagaa 

tccctcctcg 

tttcaagact 

cactttggga 

acatggtgaa 

gcctgtagtc 

gagcttgcag 

ccgtctcaaa 

tgagaaattg 

ttgctgctcc 

gtggatccct 

atgcctttca 

tttgtttctt 

tggtgtagca 

tcctcacttt 

ttctcaaatc 

gctccaaagt 

tttggttaga 

ttaaatttct 

gtcacccagg 

tccaagtgat 

cacccaccta 

gtcttaaact 

cagatgtgag 

gtttttgtct 

tattcttaaa 

ctaggctgga 

gcgattctcc 

ggctaatttt 

gatctcgtga 

gagccaccgc 

gctgtagtgc 

gtctcccacc 

ggttgtgttg 

ctcctgggct 

agccaccgca 

ttctttgcat 

ttactttgaa 

ttgtcgtttc 

tcaggggtgc 

cctttaaaCa 

atgctaggac 

gcctggccag 

tggcacctgc 

agggggcgga 

gcgagactct 

ccaggttgcc 

cctcagcctt 



aaaccccatc 

Gccagctact 

gtgagccagg 

agaaaagaaa 

tcaaactcct 

gcaagagcca 

gatctcaaga 

gtgtctttat 

gagaaagggt 

tgcaacctct 

attacaggca 

cagactggtc 

gggattacag 

ggcctcactc 

gcctcccaag 

ttaaaaatgc 

ggccgaggtg 

accctgtctc 

ccagctactc 

tgagctgaga 

aaaaaaaaaa 

gctgttaatc 

aggagactct 

caatttataa 

ttaaatttgc 

tctgtctgtt 

caggtccctt 

ggataaattc 

tgctgttgaa 

ttctatttgg 

catggtcctt 

ttttatttag 

ctggcaggca 

tctcctgcct 

atttttggta 

actaacctca 

ccactgtgcc 

agtcattcaa 

aaattttttt 

gtgcaatggc 

tgcttcagcc 

ttgcattttc 

cctcatgatc 

gcccagccta 

agtggcacaa 

ttagtctcct 

ttactgtttt 

caagcagtca 

cccggcctgt 

gccttgtaat 

aatcagatcc 

tcgtttgttt 

ttgtttctgt 

cctagtggca 

tttgggaagt 

tatggtgaaa 

ctgtagtccc 

ggttgcagtg 

gtctcaaata 

tctgtactgg 

Cacttctcgc 



tctactaaaa 

caggaggctg 

atcacaccac 

aaaaaaagct 

gacttgaagg 

ccgcaccctg 

catcctaacc 

ttctgcatat 

cttgctctgt 

gcctcccggg 

cacaccacca 

tcgaattcct 

gcgtgaaaca 

aagcccaggc 

gtgctgagat 

catcttggct 

ggcagatcac 

tactaaaaat 

gggaagctga 

tcacaccact 

aaaatgtcat 

ttattgagga 

ctatccctgg 

gccacttgga 

aaagtttcag 

tgccgcttat 

gggctctagg 

agctgacctg 

acttctggtg 

tttctttctg 

tcgtcttcct 

ttattaattt 

acgtcacaat 

cagcctccca 

tttttagtag 

ggtgatctgt 

cagcctcctt 

atgtctaggc 

taattttctt 

ttgatcttgg 

tcctgagtag 

agtagagaca 

ctcctgcctc 

ttttttattt 

ccctggcaca 

gagtagctag 

agacacaggg 

tcccaccttg 

taatttcttt 

ttctcgttga 

tccgccctct 

agttactttc 

tcttttaggt 

aaaaggataa 

ggaggcgggt 

ccctgtctct 

agctactggg 

agctgagact 

aaaaaaaaaa 

ggcaagtcct 

tcccatggag 



atacaaaaat 

aggcaggaga 

tgcactccag 

tttgcagaga 

agtcctactg 

ccacttggct 

tcccttttgt 

ttgtaaattt 

cgcccaggct 

ttcaagtgat 

cacctggcta 

gacctcaggt 

ccatgcccag 

tggtcctgaa 

tacaggtgtg 

gggcacggtg 

gaggtcagga 

acaaaaaatt 

agcaggagaa. 

gtactccagc 

ctcactgcct 

acatttatat 

gtttcagcag 

gttcattgga 

ctactattct 

ggaactctat 

catttttctt 

tcctcaagtt 

aaattttcac 

tagtaattat 

ttagttcatt 

tttttttttt 

cttggctcac 

agtagctggg 

aaactgggtt 

ccgcctcagc 

ttttagtgta 

ttcctcagga 

ttttttttag 

cccactgcaa 

ctgggactat 

tggttttgcc 

ggcctcccaa 

tttgagacaa 

ctgcagcctt 

aactacaagc 

tcttgctaca 

gcctcccaag 

atttccggtg 

aacctgcaca 

gcagggttca 

ctgacctttt 

tagtggttag 

agaggccggg 

ggatcacttg 

accaaaaata 

aagactgagg 

gcgccattgc 

aaaaaggata 

tcagtgtccg 

cctaaggatg 



tagctgggcg 

atcgcttgaa 

cctgggtaac 

cagggtatca 

cctcagcctc 

gttttgttct 

ttttttgttc 

tccaaaaaag 

ggagaatggt 

cctcccacct 

atttttgtat 

gatctgcccg 

cccccaattt 

ctcctgagct 

agtcacagta 

gctcacgcct 

gatcaagacc 

aaccaggtgt 

tggcgtgaac 

ctgggcaaca 

tctggtccaa 

attgactagt 

tttgattata 

cttcttggat 

ttgcatcttg 

gcatacattg 

tgttcttttt 

cactgtttct 

tacagttact 

cactttacta 

atccatggtt 

tgaagcgggg 

tacaacctcc 

attataggca 

tcaccacatt 

ctcccaaaat 

tttaaggtaa 

acagtttcta 

atggagtctc 

cctctgcctc 

aggtgcgtgc 

gtgttagcca 

agtgctggaa 

agtctccctc 

aaccgtccag 

atgtgccacc 

tttctctgac 

gtgttgagat 

aatgggccac 

atttgaagat 

ttgttgctgt 

taaataaaga 

cttgtgcttt 

cgcagtggct 

aggtcaggag 

caaaaattaa 

caggagaatc 

actccagcct 

aagagtgtct 

ccaggctgtt 

aaccagaggt 



tggtggcagg 

cctgggaggc 

aaaagcaaaa 

ctttgttgcc 

ccaaagtgct 

gttgtatttc 

gacttactgg 

tttttctttc 

ggtgcacaat 

cagccttccc 

tttagtctta 

ccttggcctc 

ttttttttta 

caagctgtca 

cctggccttc 

gtaatcccag 

accctggcta 

ggtggcaggt 

ccgggaggtg 

gtgcgagact 

tagtttctga 

cacttgtctc 

atgtatcagt 

gtgtaaattc 

aaatactagt 

atgtgcttca 

ttctttctgc 

ttcttcttcc 

gtacttttta 

gtattctcta 

tcctttattt 

tttcactctt 

gcctcctggg 

tgtgccacca 

ggccagactg 

gctgggatta 

ttgattgaaa 

ttaatttctt 

actctatagc 

ctgggttcaa 

caccactcct 

ggatggtctc 

ttacaggtgt 

tctcacccag 

gcttaagtga 

atgcctggct 

tggtcttgaa 

tacaggtgtg 

actttcttgt 

gataatgtgg 

ttgttgtgga 

ctatattctg 

gaaagagatt 

cacgcctgta 

tttaagatca 

ccgggcatgg 

gcttcaatcc 

gggcaacaga 

tccatccttt 

cacggctttt 

gaaag.ttgag 



106980 

107040 

107100 

107160 

107220 

107280 

107340 

107400 

107460 

107520 

107580 

107640 

107700 

107760 

107820 

107880 

107940 

108000 

108060 

108120 

108180 

108240 

108300 

108360 

108420 

108480 

108540 

108600 

108660 

108720 

108780 

108340 

108900 

108960 

109020 

109080 

109140 

109200 

109260 

109320 

109380 

109440 

109500 

109560 

109620 

109680 

109740 

109800 

109860 

109920 

109980 

110040 

110100 

110160 

110220 

110280 

110340 

110400 

110460 

110520 

110580 
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ggcctcctca 
tttccaagca 
gccCcttcct 
acaacttcag 
gttatcccag 
ccagcctggc 
tggtgggtgc 
gggaggcgga 
gtgagactcc 
aggctgcccc 
acactagaaa 
tctgacacca 
tggtagtagt 
cttcattgag 
tcttgacagt 
gtttactgtg 
gctgccaact 
ttttaagtta 
caatgtactt 
agtttcactc 
cgcctcccag 
acccccacca 
ccaggctggt 
tgggattaca 
gatctacttc 
acctcccttt 
ctctggcata 
gacacagctc 
tagctccatc 
tccatcctcc 
tcctcccttc 
cccttcccct 
tctcctctga 
ctctgacata 
gacatagctc 
tagctccatc 
ttgccatata 
tcaaaagtat 
ttttgggggc 
catgacgaaa 
ctgtagtccc 
ggaagtcaag 
ttatctcaga 
gaggctgagg 
tgaaaccccg 
atcccagcta 
cagtgagcaa 
aaaaaaaaaa 
aatcccaaag 
agcctgggca 
aaaaagtata 
ttctttcctg 
atttagaatt 
aatggtgcag 
tcagcctccc 
gtatttttag 
tcatgatctg 
gcccatttag 
aggctggagt 
cattctcctg 
ctaatttttt 



ggtgtttctg 
tgaacaggag 
tcctaggctt 
gtagtatcta 
cactttggga 
caacatggtg 
ctgtaatctc 
ggttgcagtg 
atctcgggga 
agccctgaga 
gatccaaatg 
gcaagccaca 
ggctgagcag 
cagtcatctg 
ttaatagaaa 
acagttttga 
ccatcatttt 
ttttcttagt 
cagattatta 
tgtcacccag 
gttcacgcca 
cacccggcta 
ctcgaactgc 
ggtgtgagcg 
cgttgtacta 
tcctctgaca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctatgaca 
atgacacagc 
catagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
tacatttatg 
tttgaaagaa 
tgaggtgggc 
cccatctcca 
agctatttgg 
gctgcagtga 
agtaaaggga 
tgagtggatc 
tctctactaa 
ctcgggaggc 
gatcgcacca 
aaaaagaaaa 
ccttgggagg 
acatagcgag 
cttgtttttt 
tgaatttgag 
tttttttttt 
tctcagatca 
aagtagctga 
tagagacagg 
cctgtcctgg 
aatttctttt 
gcagtggcac 
cctcagcctc 
gtatttttag 



agcccccgtc 
ctttccaaag 
ttctgtcagc 
cctgcaaatg 
ggccgaggcg 
aagccccgtc 
age tact egg 
agctgagatc 
aaaaaaaaaa 
gagttcgagg 
cataagcagg 
tcagaaatac 
aaatagccca 
gttcgtagac 
aaagtttatt 
cagcagaata 
ggtgatatcc 
ggttgctcta 
ctaacttaac 
gctggagtgc 
ttctcctgcc 
attttgtatt 
tgacctcagg 
actgcactga 
tgacatagtt 
tagttccacc 
cccttctcct 
tctcctatga 
ctatgtcata 
ggcatagctc 
cagctccatc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctctga 
ctctgacata 
tatgttataa 
gttgcctgga 
agatcgcctg 
ccaaaattac 
gggaggatcc 
gctgagattg 
ctaggaatgg 
accggaggtc 
aaatacaaaa 
tgaggcaaga 
ctgcattaca 
gaaaagaaat 
ccaaggcagg 
actccatctc 
tgtatcatcc 
ttactgtcta 
tttgagacaa 
ctgcaacctc 
gactgcaggc 
gtttcactat 
cctcccaaaa 
tttttttttt 
gatctcggct 
ccgagtagct 
gagagatggg 



tagccccagc 
cccttagacc 
tctttgccca 
ccttcaggcc 
ggtgaattgc 
tctagtaaaa 
gaggctgaag 
gtgccattgc 
aaaaaaatgc 
gggtgaaaca 
attccttgag 
aggttgcctt 
gctgtcctcc 
accagagtta 
gtgacagttt 
gttgctttgc 
agctctgttg 
gagatgacaa 
acttaaagta 
aatggtgtga 
tcagcctcct 
tttagtagag 
tgatccgccc 
gcctaagtat 
ccccctccat 
ctccctcctc 
atgacacagc 
cacagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctctgaca 
ctgacatagc 
catagttcca 
gctccatccc 
cttcagctct 
ggcagtggct 
agccagggag 
.aaaaaattag 
cagctaaggt 
tgccactgca 
tggcttttat 
aggagtttaa 
agtagccggg 
gaatcgcttg 
gcctagatga 
taagatctag 
aggatcactt 
tatttaaaaa 
atattttacc 
gtgtcatttc 
agtctcactg 
tgcctccctg 
acctgccacc 
gttggccagg 
tgctgggatt 
ttttgagatg 
cactgcgagc 
gggattacag 
gtttcaccat 



tgtgtgcatg 
ttcacgtagc 
tctgttgttg 
aggcgcggtg 
ttgaggtcag 
atacaaaaat 
caggagaatt 
actccagcct 
catcaacagc 
aaggcaagcc 
aaaaggtctg 
ccccatggct 
tgaaatttag 
cagaaaagtt 
tgacagctga 
tggagagacg 
ctgaattttt 
tgtgcatctt 
cagcattttt 
tctcggctca 
gagtagctgg 
atgaggtttc 
atcttggcct 
ggcaacgtgt 
tttcctatag 
ctatgacgtc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctatga 
ctatgacaca 
ggcatagctc 
tagctccatc 
tccatcctcc 
tcctcccttg 
ctcttctcct 
tcagcgttat 
tatgccttta 
ttggagacca 
tctggcatgg 
gggaggatca 
ctccagcctg 
ctctaatccc 
gaccagcctg 
tgtggtggtg 
aacctgggaa 
cagagcgaga 
acactgtggt 
gaggccagga 
agaaagaaat 
tttctttttt 
cttttagtct 
tgttgcccag 
gttagagtga 
acccccagcc 
ctggtctcga 
accatgagcc 
gggtctcgct 
tccgcctccc 
gcgcctgcca 
gttagccagg 



gccttctgga 
tcttttccca 
tccctccccc 
gctcatacct 
gagttcgaga 
tagctgggcg 
gcttgagcct 
gggcgacaga 
acgaccctgg 
cttcagggag 
tatcatccct 
acatgtgagc 
cagggtctta 
tattgggagg 
atagaaaaaa 
gatctttgga 
agctatgctg 
taacttacca 
ttttttatgg 
ctgcaacctc 
gactacaggc 
accatgttgg 
cccaaagtgc 
ctataacata 
cacagtccca 
ctcccttctc 
cttctcctct 
tcctctgaca 
ctgacacagc 
cacagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctctgaca 
tcatgtatta 
aattattgct 
actccagcac 
gcctgggcaa 
tggcacgcgc 
cttgagcctg 
ggtgcagatc 
agcactttgg 
gccaacatgg 
ggtgtctgta 
gcggaggttg 
ctctgcctaa 
tcatgcctgt 
gttcaacacc 
tcaaagagaa 
ttgccccttt 
gaagaacttc 
gctggagtgc 
tt'ttcctgcc 
aatttttttg 
attcatgacc 
accacgccca 
cttgtttccc 
•gggttcacgc 
ccacgcccac 
atggtcttga 



110640 
110700 
110760 
110820 
110880 
110940 
1,11000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
111780 
111840 
111900 
111960 
.112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
il2500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
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tctcctgacc 
gccaccgcgc 
tatctgggaa 
attattaatt 
cattgtttct 
tttttctctt 
tatctaggtg 
catagatcaa 
caaatttttt 
cgtgcgttct 



tcgtgatccg 
ccggctagaa 
tgtctttact 
gatcattctt 
gacgagaagt 
gatgttttca 
tggatctctt 
tgttttgttt 
tcctgcccct 
eta 



cccgccttgg 
tttcttgtag 
tcagcctcat 
ttcagtgttt 
catctgtcaa 
agattttctc 
ggtgtgcatc 
tgttttacca 
ctctcatctc 



cctcccaaag 
gacaggcttg 
tttttgaagg 
aaaagtgtca 
attgtccctt 
tttgtctttg 
gtatccgggc 
aatctggaga 
cttttgggag 



tgctgggatt 
ctagcaacca 
atagtttagc 
tcatgctacc 
tgtacttgaa 
gcctttagta 
ttcagtaagc 
gtttttactc 
taccactgca. 



acaggcgtga 
attcagtgtt 
tggccataga 
ttctgggttc 
gaattatctt 
gtttgtgatg 
ctcttagatt 
atcatttcaa 
tgtatgttgg 



114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
114793 



<210> SEQ ID N0:4 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM:Arcif icial Sequence 



<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 



<400> SEQ ID NO: 4 
cacaggtcca gcatgtctgt gcgtc 

<210> SEQ ID NO: 5 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 



.<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 5 
cacagtccct gctggcctct gtcta 

<210> SEQ ID NO: 6 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID N0:-6 
caggacatct ccatcaagag gctgc 

<210> SEQ ID NO: 7 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 



<400> SEQ ID NO: 7 
aataagaggg ggccaggatc agtgc 

<210> SEQ ID N0:8 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artif icial Sequence 
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<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 8 
gtgaatggca tcctggagag 

<210> SEQ ID NO: 9 
<211> LENGTH:. 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 9 
gtctccaggc agctcaacag 

<210> SEQ ID NO:10 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 10 
accctgtccc tcctgtctga 

<210> SEQ ID NO: 11 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 11 
agaccctaag atgttcggag 

<210> SEQ ID NO: 12 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM; Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 12 
gatgacctgt gtgagttgcg 

<210> SEQ ID NO: 13 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 13 
cgcaactcac acaggtcatc 
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20 



20 - 



20 



20 



20 
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<210> SEQ ID NO: 14 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 14 
ggagtcaggt caaaggatgc 

<210> SEQ ID NO: 15 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 15 
gcatcctttg acctgactcc 

<210> SEQ ID NO: 16 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artif icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 16 
ggtctgaaac gtgatctggg 

<210> SEQ ID NO: 17 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Art if icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 17 
cccagatcac gtctcagacc 

<210> SEQ ID NO: 18 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 18 
cgatgatgtg tgggttctcc 

<210> SEQ ID NO: 19 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence . 
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<22 0> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID N0:19 
ggagaaccca cacatcatcg 

<210> SEQ ID NO:20 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE:' 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID N0:20 
cgtgtctgag aagtccagcc 

<210> SEQ ID N0:21 
<2il> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID N0:21 
ggctggactt ctcagacacg 

<210> SEQ ID NO:22 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 22 
acagcatctt ctccacgcac 

<210> SEQ ID NO:23 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 23 
agtcctctgg ctttgcagtg 

<210> SEQ ID NO: 24 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 24 
.tgtgcgtgga gaagatgctg 
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<210> SEQ ID NO: 25 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGAKISM:Artif icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 25 
ggctggaaag ggaagCctac 

<210> SEQ ID NO: 26 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISMrArtif icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 26 
tggttcaggt gctcttgggg 

<210> SEQ ID NO: 27 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 27 
cgtgaagcag gagttgagcc 

<210> SEQ ID NO: 28 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artif icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 28 
atcttgctct gggtcttccc 

<210> SEQ ID NO:29 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 29 
cactgcaaag ccagaggact 

<210> SEQ ID NO:30 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
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<22 0> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 30 
ataagcaaga cgacgacctc 

<210> SEQ ID NO: 31 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artif icial Sequence 
<220> FEATtJRE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 31 
ctattctgtt gggtgggttc 

<210> SEQ ID NO: 32 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:32 
cgtgcctcct gtgcttaccc 

<210> SEQ ID NO:33 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213>. ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:33 
cagaccccaa ggtagctcag 

<210> SEQ ID NO:34 

<211> LENGTH: 20 

<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 34 
ggaagaccca gagcaagatc 

<210> SEQ ID NO: 35 
<211> LENGTH: 780 
<212> TYPE: PRT 

<213> ORGANISM :Dictyostelim discoideum 
<400> SEQ ID NO: 35 

Met Lys Phe Tyr lie Glu Asp Leu Leu Val Tyr Phe Pro Tyr Ser Tyr 

1 5 10 15 

lie Tyr Pro Glu Glh Tyr Ser Tyr Met Val Ala Leu Lys Arg Ser Leu 
20 25 , 30 
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Asp Asn Gly Gly Pro Cys 


He 


Leu 


Glu 


Met 


Pro 


Ser Gly Thr 


Gly Lys 


35 








40 










45 








Thr Val Ser Leu 


Leu 


Ser 


Leu 


He 


Ser 


oer 


Tyr 


Gin 


Val 


Lys 


Asn 


Pro 


50 
















60 










Ser lie Lys Leu 


He 


Tyr 


Cys 


Ser 


Arg 


mr 


vaj. 


Pro 


Glu 


He 


Glu 


Gin 


65 




70 










f D 










80 


Ala Thr Glu Glu 


Ala 
85 


Arg 


Arg 


Val 


Leu 


vjxn 
y u 


Tyr 


Arg 


Asn 


Ser 


Glu 
95 


Met 


Gly Glu Glu Ser 


Pro 


Lys 


Thr 


Leu Cys 


Met 


Ser 


Met 


Ser 


Ser 


Arg 


Arg 


100 










105 










110 






Asn Leu Cys lie 


Gin 


Pro 


Arg 


Val 


Ser 


Glu 


Glu 


Arg Asp Gly 


Lys 


Val 


115 








120 










125 








Val Asp Ala Leu Cys 


Arg 


Glu 


Leu 


Thr 


Ser 


Ser 


Trp 


Asn 


Arg 


Glu 


Ser 


130 






135 










140 










Pro Thr Ser Glu 


Lys 


Cys 


Lys 


Phe 


Phe 


Glu 


Asn 


Phe 


Glu 


Ser 


Asn Gly 


145 




150 










155 










160 


Lys Glu lie Leu 


Leu 


Glu 


Gly Val Tyr 


Ser 


Leu 


Glu Asp Leu 


Lys 


Glu 


165 










170 










175 




Tyr Gly Leu Lys 


His 


Gin 


Met 


Cys 


Pro 


Tyr 


Phe 


Leu 


Ser 


Arg 


His 


Met 


130 










185 










190 






Leu Asn Phe Ala 


Asn 


He 


Val 


He 


Phe 


Ser Tyr 


Gin 


Tyr 


Leu 


Leu Asp 


195 








200 










205 








Pro Lys lie Ala 


Ser 


Leu 


He 


Ser 


Ser 


Ser 


Phe 


Pro 


Ser 


Asn 


Ser 


He 


210 






215 










220 










Val Val Phe Asp 


Glu 


Ala 


His 


Asn 


He 


Asp 


Asn 


Val 


Cys 


He 


Asn 


Ala 


225 




230 










235 










240 


Leu Ser lie Asn 


He 
245 


Asp 


Asn 


Lys 


Leu 


Leu 
250 


Asp 


Thr 


Ser 


Ser 


Lys 
255 


Asn 


lie Ala Lys lie 


Asn 


Lys 


Gin 


He 


Glu 


Asp 


He 


Lys 


Lys 


Val 


Asp 


Glu 


260 










265 










270 






Lys Arg Leu Lys 


Asp 


Glu 


Tyr 


Gin 


Arg 


Leu 


Val 


Asn Gly 


Leu 


Ala 


Arg 


275 








280 










285 








Ser Gly Ser Thr Arg Ala 


Asp 


Glu 


Thr 


Thr 


Ser 


Asp 


Pro 


Val 


Leu 


Pro 


290 






295 










300 










Asn Asp Val lie 


Gin 


Glu 


Ala 


Val 


Pro Gly Asn 


He 


Arg 


Lys 


Pro 


Ser 


305 




310 










315 










320 


lie Phe lie Ser 


Leu 


Leu 


Arg 


Arg 


Val 


Val 


Asp 


Tyr Leu 


Arg 


Glu 


Pro 




325 










330 










335 




Asp Lys Ser Arg 


Leu 


Lys 


Ser 


Gin 


Met 


Leu 


Leu 


Ser 


Glu 


Ser 


Pro 


Leu 


340 










345 










350 






Ala Phe Leu Gin Gly Leu 


Tyr 


His 


Ala 


Thr 


Gin 


He 


Ser 


Ser 


Arg 


Thr 


355 








360 










365 








Leu Arg Phe Cys 


Ser 


Ser 


Arg 


Leu 


Ser 


Ser 


Leu 


Leu 


Arg 


Thr 


Leu 


Arg 


370 






375 










380 










lie Asn Asp Val 


Asn 


Gin 


Phe 


Ser Gly 


He 


Ser 


Leu 


He 


Ala 


Asp 


Phe 


385 




390 










395 










400 


Ala Thr Leu Val Gly Thr 


Tyr Asn Asn Gly 


Phe 


Leu 


He 


He 


He 


Glu 




405 










410 










415 




Pro Tyr Tyr Gin 


Arg 


Gin 


Asn 


Asn 


Thr 


Tyr 


Asp 


Gin 


He 


Phe 


Gin 


Phe 


420 










425 










430 






Cys Cys Leu Asp Ala Ser 


He Gly Met 


Lys 


Pro 


He 


Phe 


Asp 


Lys 


Tyr 


435 








440 










445 








Arg Ser Val Val 


He 


Thr 


Ser Gly Thr 


Leu 


Ser 


Pro 


Leu 


Asp 


He 


Tyr 


450 






455 










460 










Thr Lys Met Leu 


Asn 


Phe 


Arg 


Pro 


Thr 


Val 


Val 


Glu 


Arg 


Leu 


Thr 


Met 


465 




470 










475 










480 


Ser Leu Asn Arg Asn Cys 


He 


Cys 


Pro 


Cys 


He 


Leu 


Thr 


Arg 


Gly Ser 




485 










490 










495 




Asp Gin lie Ser 


He 


Ser 


Thr 


Lys 


Phe 


Asp 


Val 


Arg 


Ser 


Asp 


Thr 


Ala 


500 










505 










510 
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Val 


Val 


Arg 


Asn 


Tyr 


Gly Ala 


Leu 


Leu 


Val 


Glu 


Val 


Ser 


Ala 


He 


Val 






515 










520 










525 








Pro 


Asp 
530 


Gly 


He 


He 


Cys 


Phe 
535. 


Phe 


Thr 


Ser 


Tyr 


Ser 
540 


Tyr 


Met 


Glu 


Gin 


He 


Val 


Ser 


Val 


Trp 


Asn 


Glu 


Met 


Gly Leu 


Leu 


Asn 


Asn 


He 


Leu 


Thr 


545 










550 










555 










560 


Asn 


Lys 


Leu 


He 


Phe 
565 


Val 


Glu 


Thr 


Ser 


Asp 
570 


Pro 


Ala 


Glu 


Ser 


Ala 
575 


Leu 


Ala 


Leu 


Gin 


Asn 


Tyr 


Lys 


Lys 


Ala Cys 


Asp 


Ser 


Gly 


Arg 


Gly 


Ala 


Val 








580 










585 










590 






Leu 


Leu 


Ser 


Val 


Ala 


Arg Gly 


Lys 


Val 


Ser 


Glu Gly 


He 


Asp 


Phe 


Asp 






595 










600 










605 








Asn 


Gin 
610 


Tyr 


Gly 


Arg 


Cys 


Val 
615 


He 


Leu 


Tyr 


Gly 


He 
620 


Pro 


Tyr 


He 


Asn 


Thr 


Glu 


Ser 


Lys 


Val 


Leu 


Arg 


Ala 


Arg 


Leu 


Glu 


Phe 


Leu 


Arg 


Asp 


Arg 


625 










630 










635 










640 


Tyr 


Gin 


He 


Arg 


Glu 
645 


Asn 


Glu 


Phe 


Leu 


Thr 
650 


Phe 


Asp 


Ala 


Met 


Arg 
655 


Thr 


Ala 


Ser 


Gin 


Cys 


Val 


Gly Arg 


Val 


He 


Arg 


Gly Lys 


Ser 


Asp 


Tyr 


Gly 








660 










665 










670 






He 


Met 


He 
675 


Phe 


Ala 


Asp 


Lys 


Arg 
680 


Tyr 


Asn 


Arg 


Leu 


Asp 
685 


Lys 


Arg 


Asn 


Lys 


Leu 
690 


Pro 


Gin 


Trp 


He 


Leu 
695 


Gin 


Phe 


Cys 


Gin 


Pro 
700 


Gin 


His 


Leu 


Asn 


Leu 


Ser 


Thr 


Asp 


Met 


Ala 


He 


Ser 


Leu 


Ser 


Lys 


Thr 


Phe 


Leu 


Arg 


Glu 


705 










710 










715 










720 


Met 


Gly 


Gin 


Pro 


Phe 


Ser 


Arg 


Glu 


Glu 


Gin 


Leu Gly 


Lys 


Ser 


Leu 


Trp 










725 










730 










735 




Ser 


Leu 


Glu 


His 
740 


Val 


Glu 


Lys 


Gin 


Ser 
745 


Thr 


Ser 


Lys 


Pro 


Pro 
750 


Gin 


Gin 


Gin 


Asn 


Ser 
755 


Ala 


He 


Asn 


Ser 


Thr 
760 


He 


Thr 


Thr 


Ser 


Thr 
765 


Thr 


Thr 


Thr 


Thr 


Thr 
770 


Thr 


Ser 


Thr 


lie 


Ser 
775 


Glu 


Thr 


His 


Leu 


Thr 
780 











<210> SEQ ID NO:36 

<211> LENGTH: 778 

<212> TYPE: PRT 

<213> ORGANISM: S. cerevisiae 



<400> SEQ ID NO: 36 



Met 


Lys 


Phe 


Tyr 


He 


Asp 


Asp 


Leu 


Pro 


Val 


Leu 


Phe 


Pro 


Tyr 


Pro 


Lys 


1 








5 










10 










15 




He 


Tyr 


Pro 


Glu 


Gin 


Tyr 


Asn 


Tyr 


Met 


Cys 


Asp 


He 


Lys 


Lys 


Thr 


Leu 








20 










25 










30 






Asp 


Val 


Gly 


Gly 


Asn 


Ser 


He 


Leu 


Glu 


Met 


Pro 


Ser 


Gly 


Thr 


Gly 


Lys 






35 










40 










45 








Thr 


Val 


Ser 


Leu 


Leu 


Ser 


Leu 


Thr 


He 


Ala 


Tyr 


Gin 


Met 


His 


Tyr 


Pro 




50 










55 










60 










Glu 


His 


Arg 


Lys 


He 


He 


Tyr 


Cys 


Ser 


Arg 


Thr 


Met 


Ser 


Glu 


He 


Glu 


65 










70 










75 










80 


Lys 


Ala 


Leu 


Val 


Glu 


Leu 


Glu 


Asn 


Leu 


Met 


Asp 


Tyr 


Arg 


Thr 


Lys 


Glu 










85 










90 










95 




Leu Gly 


Tyr 


Gin 


Glu 


Asp 


Phe 


Arg Gly Leu 


Gly Leu 


Thr 


Ser 


Arg 


Lys 








.100 










105 










110 






Asn 


Leu 


Cys 


Leu 


His 


Pro 


Glu 


Val 


Ser 


Lys 


Glu 


Arg 


Lys 


Gly 


Thr 


Val 






115 










120 










125 








Val 


Asp 


Glu 


Lys 


Cys 


Arg 


Arg 


Met 


Thr 


Asn 


Gly Gin 


Ala 


Lys 


Arg 


Lys 




130 










135 










140 










Leu. Glu 


Glu 


Asp 


Pro 


Glu 


Ala 


Asn 


Val 


Glu 


Leu Cys 


Glu 


Tyr 


His 


Glu 



48 



wo 01/42434 PCT/US00/33t)65 



145 150 155 160 

Asn Leu Tyr Asn lie Glu Val Glu Asp Tyr Leu Pro Lys Gly Val Phe 

165 170 175 

Ser Phe Glu Lys Leu Leu Lys Tyr Cys Glu Glu Lys Thr Leu Cys Pro 

180 185 190 

Tyr Phe lie Val Arg Arg Met He Ser Leu Cys Asn He He He Tyr 

195 200 205 

Ser Tyr His Tyr Leu Leu Asp Pro Lys lie Ala Glu Arg Val Ser Asn 

210 215 220 

Glu Val Ser Lys Asp Ser He Val He Phe Asp Glu Ala His Asn He 
225 230 235 240 

Asp. Asn Val Cys He Glu Ser Leu Ser Leu Asp Leu Thr Thr Asp Ala 

245 250 255 

Leu Arg Arg Ala Thr Arg Gly Ala Asn Ala Leu Asp Glu Arg. He Ser 

260 265 270 

Glu Val Arg Lys Val Asp Ser Gin Lys Leu Gin Asp Glu Tyr Glu Lys 

275 280 285 

Leu Val Gin Gly Leu His Ser Ala Asp He Leu Thr Asp Gin Glu Glu 

290 295 300 

Pro Phe Val Glu Thr Pro Val Leu Pro Gin Asp Leu Leu Thr Glu Ala 
305 310 315 320 

He Pro Gly Asn He Arg Arg Ala Glu His Phe Val Ser Phe Leu Lys 

325 330 335 

Arg Leu He Glu Tyr Leu Lys- Thr Arg Met Lys Val Leu His Val He 

340 345 350 

Ser Glu Thr Pro Lys Ser Phe Leu Gin His Leu Lys Gin Leu Thr Phe 

355 360 365 

He Glu Arg Lys Pro Leu Arg Phe Cys Ser Glu Arg Leu Ser Leu Leu 

370 375 380 

Val Arg Thr Leu Glu Val Thr Glu Val Glu Asp Phe Thr Ala Leu Lys 
385 390 395 400 

Asp He Ala Thr Phe Ala Thr Leu He Ser Thr Tyr Glu Glu Gly Phe 

405 410 415 

Leu Leu He He Glu Pro Tyr Glu He Glu Asn Ala Ala Val Pro Asn 

420 425 430 

Pro He Met Arg Phe Thr Cys Leu Asp Ala Ser He Ala He Lys Pro 

435 440 445 

Val Phe Glu Arg Phe Ser Ser Val He He Thr Ser Gly Thr He Ser 

450 455 460 

Pro Leu Asp Met Tyr Pro Arg Met Leu Asn Phe Lys Thr Val Leu Gin 
465 470 475 480 

Lys Ser Tyr Ala Met Thr Leu Ala Lys Lys Ser Phe Leu Pro Met He 

485 490 495 

He Thr Lys Gly Ser Asp Gin Val Ala He Ser Ser Arg Phe Glu He 

500 505 510 

Arg Asn Asp Pro Ser He Val Arg Asn Tyr Gly Ser Met Leu Val Glu 

515 520 525 

Phe Ala Lys He Thr Pro Asp Gly Met Val Val Phe Phe Pro Ser Tyr 

530 535 540 

Leu Tyr Met Glu Ser He Val Ser Met Trp Gin Thr Met Gly He Leu 
545 550 555 560 

Asp Glu Val Trp Lys His Lys Leu He Leu Val Glu Thr Pro Asp Ala 

565 570 575 

Gin Glu Thr Ser Leu Ala Leu Glu Thr Tyr Arg Lys Ala Cys Ser Asn 

580 585 590 

Gly Arg Gly Ala He Leu Leu Ser Val Ala Arg Gly Lys Val Ser Glu 

595 600 605 

Gly He Asp Phe Asp His Gin Tyr Gly Arg Thr Val Leu Met He Gly 
610 615 620 
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He Pro Phe Gin Tyr Thr Glu Ser Arg He Leu Lys Ala Arg Leu Glu 
625 630 635 640 

Phe Met Arg Glu Asn Xyr Arg He Arg Glu Asn Asp Phe Leu Ser Phe 

645 650 655 

Asp Ala Met Arg His Ala Ala Gin Cys Leu Gly Arg Val Leu Arg Gly 

660 665 670 

Lys Asp Asp Tyr Gly Val Met Val Leu Ala Asp Arg Arg Phe Ser Arg 

675 680 685 

Lys Arg Ser Gin Leu Pro Lys Trp He Ala Gin Gly Leu Ser Asp Ala 

690 695 700 

Asp Leu Asn Leu Ser Thr Asp Met Ala He Ser Asn Thr Lys Gin Phe 
705 710 715 720 

Leu Arg Thr Met Ala Gin Pro Thr Asp Pro Lys Asp Gin Glu Gly Val 

725 730 735 

Ser Val Trp Ser Tyr Glu Asp Leu He Lys His Gin Asn Ser Arg Lys 

740 745 750 

Asp Gin Gly Gly Phe lie Glu Asn Glu Asn Lys Glu Gly Glu Gin Asp 

755 760 . 765 

Glu Asp Glu Asp Glu Asp He Glu Met Gin 
770 775 

<210> SEQ ID NO: 37 
<211> LENGTH: 772 . 
<212> TYPE: PRT 
<213> ORGANISM: S. pombe 

. <400> SEQ ID NO: 37 

Met Lys Phe Tyr He Asp Asp Leu Pro He Leu Phe Pro Tyr Pro Arg 
1 5 10 15 - 

He Tyr Pro. Glu Gin Tyr Gin Tyr Met Cys Asp Leu Lys His Ser Leu 

20 25 30 

Asp Ala Gly Gly He Ala Leu Leu Glu Met Pro Ser Gly Thr Gly Lys 

35 40 45 

Thr He Ser Leu Leu Ser Leu He Val Ser Tyr Gin Gin His Tyr Pro 

50 55 60 

Glu His Arg Lys Leu He Tyr Cys Ser Arg Thr Met Ser Glu He Asp 
65 70^ 75 80 

Lys Ala Leu Ala Glu Leu Lys Arg Leu Met Ala Tyr Arg Thr Ser Gin 

85 90 95 

Leu Gly Tyr Glu Glu Pro Phe Leu Gly Leu Gly Leu Thr Ser Arg Lys 

100 105 110 

Asn Leu Cys Leu His Pro Ser Val Arg Arg Glu Lys Asn Gly Asn Val 

115 120 125 

Val Asp Ala Arg Cys Arg Ser Leu Thr Ala Gly Phe Val Arg Glu Gin 

130 135 140 

Arg Leu Ala Gly Met Asp Val Pro Thr Cys Glu Phe His Asp Asn Leu 
145 150 155 160 

Glu Asp Leu Glu Pro His Ser Leu He Ser Asn Gly Val Trp Thr Leu 

165 170 . 175 

Asp Asp He Thr Glu Tyr Gly Glu Lys Thr Thr Arg Cys Pro Tyr Phe 

180 185 190 

Thr Val Arg Arg Met Leu Pro Phe Cys Asn Val He He Tyr Ser Tyr 

195 200 205 

His Tyr Leu Leu Asp Pro Lys He Ala Glu Arg Val Ser Arg Glu Leu 

210 215 220 

Ser Lys Asp Cys He Val Val Phe Asp Glu Ala His Asn He Asp Asn 
225 230 235 240 

Val Cys He Glu Ser Leu Ser He Asp Leu Thr Glu Ser Ser Leu Arg 
245 250 255 
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Lys 


Ala 


Ser 


Lys 


Ser 


He 


Leu 


Ser 


Leu 








260 










265 


Lys 


Gin 


Ser 


Asp 


Ser 


Lys- 


Lys 


Leu 


Gin 






275 










280 




Arg Gly Leu Gin Asp Ala 


Asn 


Ala 


Ala 




290 










295 






Ala 


Asn 


Pro 


Val 


Leu 


Pro 


Glu 


Asp Val 


305 










310 








Asn 


He Arg Arg Ala Glu 


His 


Phe 


He 










325 










Glu 


Tyr 


Leu 


Lys 


Thr 


Arg 


Met 


Lys 


Val 








340 










345 


Pro 


Thr 


Ser 


Phe 


Leu 


Gin 


His 


Val 


Lys 






355 










360 




Lys 


Pro 


Leu 


Arg 


Phe 


Cys 


Ala 


Glu. 


Arg 




370 










375 






Leu 


Gin 


lie 


Ser 


Leu 


Val 


Glu 


Asp 


Phe 


385 










390 








Ala 


Phe 


Ala 


Thr 


Leu 


Val 


Ala 


Thr 


Tyr 










405 










Leu 


Glu 


Pro 


Phe 


Glu 


Thr 


Glu 


Asn 


Ala 








420 










425 


Arg 


Phe 


Ser Cys 


Leu 


Asp 


Ala 


Ser 


He 






435 










440 




Arg 


Phe 


Arg 


Ser 


Val 


He 


He 


Thr 


Ser 




450 










455 






Met 


Tyr 


Pro 


Lys 


Met 


Leu 


Gin 


Phe 


Asn 


465 










470 








Gly Met 


Ser 


Leu 


Ala 


Arg 


Asn 


Cys 


Phe 










485 










Gly 


Ser 


Asp 


Gin 


Val 


Ala 


He 


Ser 


Ser 








500 










505 


Pro 


Ser 


Val 


Val 


Arg 


Asn 


Tyr 


Gly Asn 






515 










520 




He 


Thr 


Pro Asp Gly Leu 


Val 


Ala 


Phe 




530 










535 






Glu 


Ser 


He 


Val 


Ser 


Ser 


Trp 


Gin 


Ser 


545 










550 








Trp 


Lys 


Tyr 


Lys 


Leu 


He 


Leu 


Val 


Glu 










565 










Thr 


Leu 


Ala 


Leu 


Glu 


Thr 


Tyr 


Arg 


Ala 








580 










585 


Ala 


Val 


Leu 


Leu 


Ser 


Val 


Ala 


Arg 


Gly 






595 










600 




Phe 


Asp 


His 


His Tyr Gly 


Arg 


Ala 


Val 




610 










615 






Gin 


Tyr 


Thr 


Glu 


Ser 


Arg 


Val 


Leu 


Lys 


625 










630 








Asp 


Thr 


Tyr 


Gin 


He 


Arg 


Glu 


Ala 


Asp 










645 










Arg His Ala Ala Gin Cys 


Leu 


Gly Arg 








660 










665 


His 


Gly He Met Val 


Leu 


Ala 


Asp 


Lys 






675 










680 




Arg 


Thr 


Lys 


Leu 


Pro 


Lys 


Trp 


He 


Gin 




690 










695 






Thr 


Asn 


Leu 


Ser 


Thr 


Asp 


Met 


Ser 


Leu 


705 










710 








Arg 


Thr 


Met 


Ala 


Gin 


Pro 


Phe 


Thr 


Ala 
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Glu 


Gin 


Lys 


Val 


Asn 


Glu 


Val 










270 






Asp 


Glu 


Tyr 


Gin 


Lys 


Leu 


Val 








285 








Asn 


Asp 


Glu 


Asp 


Gin 


Phe 


Met 






300 










Leu 


Lys 


Glu 


Ala 


Val 


Pro Gly 




315 










320 


Ala 


Phe 


Leu 


Lys 


Arg 


Phe 


Val 


330 










335 




Leu 


His 


Val 


He 


Ala 


Glu 


Thr 










350 






Asp 


He 


Thr 


Phe 


He 


Asp 


Lys 








365 








Leu 


Thr 


Ser 


Leu 


Val 


Arg 


Ala 






380 










His 


Ser 


Leu 


Gin 


Gin 


Val 


Val 




395 










400 


Glu 


Arg 


Gly 


Phe 


He 


Leu 


He 


410 










415 




Thr 


Val 


Pro 


Asn 


Pro 


He 


Leu 










430 






Ala 


He 


Lys 


Pro 


Val 


Phe 


Glu 








445 








Gly 


Thr 


Leu 


Ser 


Pro 


Leu 


Asp 






460 










Thr 


Val 


Met 


Gin 


Glu 


Ser 


Tyr 




475 










480 


Leu 


Pro 


Met 


Val 


Val 


Thr 


Arg 


490 










495 




Lys 


Phe 


Glu 


Ala 


Arg Asn Asp 










510 






He 


Leu 


Val 


Glu 


Phe 


Ser 


Lys 








525 








Phe 


Pro 


Ser 


Tyr 


Leu Tyr 


Leu 






540 










Met 


Gly 


He 


Leu 


Asp 


Glu 


Val 




555 










560 


Thr 


Pro 


Asp 


Pro 


His 


Glu 


Thr 


570 










575 




Ala 


Cys 


Ser 


Asn 


Gly Arg Gly 










590 






Lys 


Val 


Ser 


Glu 


Gly Val 


Asp 








605 








He 


Met 


Phe 


Gly 


He 


Pro 


Tyr 






620 










Ala 


Arg 


Leu 


Glu 


Phe 


Leu 


Arg 




635 










640 


Phe 


Leu 


Thr 


Phe 


Asp 


Ala 


Met 


650 










655 




Val 


Leu 


Arg 


Gly 


Lys 


Asp Asp 










670 






Arg 


Tyr 


Gly 


Arg 


Ser Asp 


Lys 








685 








Gin 


Tyr 


He 


Thr 


Glu 


Gly Ala 






700 










Ala 


Leu 


Ala 


Lys 


Lys 


Phe 


Leu 




715 










720 


Ser 


Asp 


Gin 


Glu 


Gly 


He 


Ser 


730 










735 
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Trp Trp 


Ser 


Leu 


Asp 


Asp 


Leu Leu 


He 


His 


Gin 


Lys 


Lys 


Ala 


Leu 


Lys 






740 








745 










750 






Ser Ala 


Ala 


He 


Glu 


Gin 


Ser Lys 


His 


Glu 


Asp 


Glu 


Met 


Asp 


He 


Asp 




755 








760 










765 








Val Val 


Glu 


Thr 
























770 




























<210> SEQ ID NO: 


.38 






















<211> LENGTH: 760 






















<212> TYPE: 


PRT 
























<213> ORGANISM: Homo 


sapien 


















<400> SEQ ID NO: 


J o 






















Met Lys 


Leu 


Asn 


Val 


Asp 


Gly Leu 


Leu 


Val 


Tyr 


Phe 


Pro 


Tyr 


Asp 


Tyr 


1 






c 
b 








10 










15 




lie Tyr 


Pro 


Glu 


Gin 


Pne 


Ser Tyr 


Met 


Arg 


Glu 


Leu 


Lys 


Arg 


Thr 


Leu 






20 








25 










30 






Asp Ala 


Lys 


Gly 


His 


Gly 


Val Leu 


Glu 


Met 


Pro 


Ser 


Gly 


Thr 


Gly 


Lys 




35 


















45 








Thr Val 


Ser 


Leu. 


Leu 


Ala 


Leu lie 


Met 


Ala 


Tyr 


Gin 


Arg 


Ala 


Tyr 


Pro 


50 










c c 








60 










Leu . Glu 


Val 


Thr 


Lys 


Leu 


lie Tyr 


Cys 


Ser 


Arg 


Thr 


Val 


Pro 


Glu 


He 


65 
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75 










80 


Glu Lys 


Val 


He 


Glu 


Glu 


Leu Arg 


Lys 


Leu 


Leu 


Asn 


Phe 


Tyr 


Glu 


Lys 








Q C 








90 










95 




Gin Glu 


Gly 


Glu 


Lys 


Leu 


Pro Pne 


Leu 


Gly 


Leu 


Ala 


Leu 


Ser 


Ser 


Arg 






100 








105 










110 






Lys Asn 


Leu 


Cys 


lie 


His 


Pro Glu 


Val 


Thr 


Pro 


Leu 


Arg 


Phe 


Gly 


Lys 




115 


















125 








Asp Val 


Asp 


Gly 


Lys 


Cys 


nxs o€r 


Leu 


Thr 


Ala 


Ser 


Tyr 


Val 


Arg 


Ala 


130 


















140 










Gin Tyr 


Gin 


His 


Asp 


inr 


Ser Leu 


Pro 


His 


Cys 


Arg 


Phe 


Tyr 


Glu 


Glu 


145 
















155 










160 


Phe Asp 


Ala 


His 


Vjxy 


Arg 




Pro 


Leu 


Pro 


Ala 


Gly 


He 


Tyr 


Asn 








J. o ^ 








170 










175 




Leu Asp 


Asp 


Leu 


Lys 


/\j.a 


Leu Gly 


Arg 


Arg 


Gin 


Gly 


Trp 


Cys 


Pro 


Tyr 






180 








185 










.190 






Phe Leu 


Ala 


Arg 


Tyr 


Cor- 
OCX 


lie Leu 


His 


Ala 


Asn 


Val 


Val 


Val 


Tyr 


Ser 




195 








200 










205 








Tyr His 


Tyr 


Leu 


Leu 


Asp 


Pro Lys 


He 


Ala 


Asp 


Leu 


Val. 


Ser 


Lys 


Glu 


210 










215 








220 










Leu Ala 


Arg 


Lys 


Ala 


Val 


•Val Val 


Phe 


Asp 


Glu 


Ala 


His 


Asn 


He 


Asp 


225 








230 








235 










240 


Asn Val 


Cys 


He 


Asp 


Ser 


Met Ser 


Val 


Asn 


Leu 


Thr 


Arg 


Arg 


Thr 


Leu 








245 








250 










255 




Asp Arg 


Cys 


Gin 


Gly 


Asn 


Leu Glu 


Thr 


Leu 


Gin 


Lys 


Thr 


Val 


Leu 


Arg 






260 








265 










270 






lie Lys 


Glu 


Thr 


Asp 


Glu 


Gin Arg 


Leu 


Arg 


Asp 


Glu 


Tyr 


Arg 


Arg 


Leu 




275 








280 










285 








Val Glu 


Gly 


Leu 


Arg 


Glu 


Ala Ser 


Ala 


Ala 


Arg 


Glu 


Thr 


Asp 


Ala 


His 


290 










295 








300 










Leu Ala 


Asn 


Pro 


Val 


Leu 


Pro Asp 


Glu 


Val 


Leu 


Gin 


Glu 


Ala 


Val 


Pro 


305 








310 








315 










320 


Gly Ser 


He 


Arg 


Thr 


Ala 


Glu His 


Phe 


Leu 


Gly 


Phe 


Leu 


Arg 


Arg 


Leu 








325 








330 










335 




Leu Glu 


Tyr 


Val 


Lys 


Trp 


Arg Leu 


Arg 


Val 


Gin 


His 


Val 


Val 


Gin 


Glu 






340 








345 










350 






Ser Pro 


Pro 


Ala 


Phe 


Leu 


Ser Gly 


Leu 


Ala 


Gin 


Arg 


Val 


Cys 


He 


Gin 




355 








360 










365 








Arg Lys 


Pro 


Leu 


Arg 


Phe 


Cys Ala 


Glu 


Arg 


Leu 


Arg 


Ser 


Leu 


Leu 


His 
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370 



375 



380 



Thr Leu Glu lie Thr Asp Leu Ala Asp Phe Ser Pro Leu Thr Leu Leu 
385 390 395 400 

Ala Asn Phe Ala Thr Leu Val Ser Thr Tyr Ala Lys Gly Phe Thr He 

405 410 415 

lie He Glu Pro Phe Asp Asp Arg Thr Pro Thr He Ala Asn Pro He 

420 425 430 

Leu His Phe Ser Cys Met Asp Ala Ser Leu Ala He Lys Pro Val Phe 

435 440 445 

Glu Arg Phe Gin Ser Val He He Thr Ser Gly Thr Leu Ser Pro Leu 

450 455 460 

Asp He Tyr Pro Lys He Leu Asp Phe His Pro Val Thr Met Ala Thr 
465 470 475 480 

Phe Thr Met Thr Leu Ala Arg Val Cys Leu Cys Pro Met He He Gly 

485 490 495 

Arg Gly Asn Asp Gin Val Ala He Ser Ser Lys Phe Glu Thr Arg Glu 

500 505 510 

Asp He Ala Val He Arg Asn Tyr Gly Asn Leu Leu Leu Glu Met Ser 

515 520 525 

Ala Val Val Pro Asp Gly He Val Ala Phe Phe Thr Ser Tyr Gin Tyr 

530 535 540 

Met Glu Ser Thr Val Ala Ser Trp Tyr Glu Gin Gly He Leu Glu Asn 
545 550 555 560 

He Gin Arg Asn Lys Leu Leu Phe He Glu Thr Gin Asp Gly Ala Glu 

565 570 575 

Thr Ser Val Ala Leu Glu Lys Tyr Gin Glu Ala Cys Glu Asn Gly Arg 

580 585 590' 

Gly Ala He Leu Leu Ser Val Ala Arg Gly Lys Val Ser Glu Gly He 

595 600 605 

Asp Phe Val His His Tyr Gly Arg Ala Val He Met Phe Gly Val Pro 

610 615 620 

Tyr Val Tyr Thr Gin Ser Arg He Leu Lys Ala Arg. Leu Glu Tyr_ Leu 
625 630 635 640 

Arg Asp Gin Phe Gin He Arg Glu Asn Asp. Phe Leu Thr Phe Asp Ala 

645 650 655 

Met Arg His Ala Ala Gin Cys Val Gly Arg Ala He Arg Gly Lys Thr 

660 665 670 

Asp Tyr Gly Leu Met Val Phe Ala Asp Lys Arg Phe Ala Arg Gly Asp 

675 680 685 

Lys Arg Gly Lys Leu Pro Arg Trp He Gin Glu His Leu Thr Asp Ala 

690 695 700 

Asn Leu Asn Leu Thr Val Asp Glu Gly Val Gin Val. Ala Lys Tyr Phe 
705 710 715 720 

Leu Arg Gin Met Ala Gin Pro Phe His Arg Glu Asp Gin Leu Gly Leu 

725 730 735 

Ser Leu Leu Ser Leu Glu Gin Leu Glu Ser Glu Glu Thr Leu Lys Arg 

740 745 750 

He Glu Gin He Ala Gin Gin Leu 



755 



760 
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